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MAMMALIAN CAAX PROCESSING ENZYMES 

BACKGROUND OF THE I NVENTION 
The eukaryotic cell is made up of a wide variety 
of specialized structures and membrane-enclosed 
5 compartments which create special intracellular 

environments. Many cellular components/ such as proteins/ 
must be targeted to particular intracellular locations for 
proper function. The lipid bilayer membrane that 
surrounds the cell is one such specialized cellular 
10 environment. Cellular components such as proteins are 

delivered to and must associate in a particular way with 
the correct membrane surface of the lipid bilayer. A 
number of mechanisms which the cell has evolved for 
targeting proteins to and inserting proteins into the 
15 lipid bilayer membrane of the cell have now been 
described. 

Proteins targeted to the lipid bilayer membrane 
may be situated within the lipid bilayer membrane (i.e., a 
portion of the protein is inserted into and across the 

20 lipid bilayer) and are typically referred to as "integral 
membrane proteins''. Integral membrane proteins possess 
intrinsic hydrophobic regions (comprising amino acid 
regions of relatively high intrinsic or apparent 
hydrophobicity) which are often inserted into the lipid 

25 bilayer as the protein is being synthesized. 
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Alternatively, proteins targeted to the lipid bilayer 
membrane may be situated at and associate with the 
membrane surface without being inserted across the 
membrane. Such proteins are typically referred to as 
5 "peripheral membrane proteins" . 

Peripheral membrane proteins are usually less 
tightly associated with the lipid bilayer than integral 
membrane proteins. Peripheral membrane proteins may 
associate with the lipid bilayer by protein-protein 
10 interactions, by hydrophobic interactions between the 

bilayer and one or more intrinsically hydrophobic regions 
of the membrane protein, or by the co- or post- 
translational addition of a lipid group to the membrane 
protein. The most recently discovered lipid modification 
15 involves the covalent attachment of cholesterol 

biosynthetic intermediates to proteins, and is referred to 
generically as protein prenylation. See, e.g., W.R. 
Schafer and J. Rine, ftnnu. Rev. Genet., 26, p. 209 (1992); 
s . Clarke, &nm *™ - Binrhem.. 61, p. 355 (1992); each of 
20 which is incorporated herein by reference. 

One class of peripheral membrane proteins 
processed by the latter mechanism has a terpenoid lipid 
added to the .C-terminal end for such purposes. Proteins 
belonging to this class (often referred to as prenylated 
25 proteins) that possess a C-terminal tetrapeptide CAAX 

motif (where C is cysteine, A is usually aliphatic, and X 
can be a variety of different residues) undergo a series 
of post-translational processing reactions. These 
reactions include the attachment of either a 15-carbon 
30 farnesyl or a 20-carbon geranylgeranyl lipid moiety to the 
cysteine residue of the CAAX motif (prenylation) , 
proteolytic removal of the C-terminal three amino acids 
(AAX) and methylesterif ication of the a-carboxyl group of 
the terminal prenylated cysteine residue. 
35 Prenylation of CAAX protein substrates is 

catalyzes by one of two different type I protein prenyl- 

f 
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transferases. If the X position is occupied by leucine or 
phenylalanine, the cysteine residue of the protein is 
geranylgeranylated. If the X position is occupied by any 
other amino acid, the cysteine residue of the protein is 
5 f arnesylated. See V. L. Boyartchuk et al., Science* 275, 
pp. 1796-1800 (1997) and M.N. Ashby, Current Opinion in 
T, ipidol, , 9, pp. 99-102 (1998) (incorporated herein by 
reference) . 

Prenylated proteins include, for example, many 

10 proteins involved in signal transduction such as the Ras 
oncogenes, fungal pheromones and the y subunits of 
trimeric guanosine triphosphate (GTP) -binding proteins (G 
proteins) . They also include a wide range of small GTP- 
binding proteins that participate in vesicular 

15 trafficking. See, e.g, P. J. Casey, Qui, Qpin. Cell, 

Riol. , 6, p. 219 (1994); J.E. Rothman and F. T. Wieland, 
c^ipnce , 272, p. 227 (1996). Prenylation is critical for 
the activity of many of these proteins. 

The yeast Saccharomyces cerevisiae possesses 

20 numerous CAAX-containing proteins including, but not 

limited to, Raslp, Ras2p, Mfalp, Mfa2p, and Stel8p. Yeast 
have two genes encoding prenylation-dependent CAAX 
endoproteinases, AFC1 and RCE1 (Boyartchuk et al., siiDTJa) • 
The Afcl protein (Afclp; encoded by AFC1) is a zinc 

25 protease that participates in the processing of the yeast 
mating pheromone a-f actor. The Reel protein (Rcelp; 
encoded by RCE1) contributes to the processing of &-f actor 
and the yeast Ras protein (Ras2p) . Rcelp is only a minor 
contributor in vivo to ^factor processing but appears to 
30 be required for Ras2p processing. Yeast have a single 

gene, STE14, which encodes the membrane-associated, CAAX 
C-terminal methyltransf erase (Stel4p) (Ashby et al., 
Yeast , 9, p. 907 (1993) (incorporated herein by 
reference) ) . 

35 Farnesylation of the Ras protein of yeast and 
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humans is required for proper membrane localization. The 
Ras protein in normal cells contributes to the orderly 
cell divisions needed during growth and development. Ras 
proteins are attached to the inner surface of the lipid 
5 bilayer membrane that surrounds each cell. Ras proteins 
function as a primary intracellular signaling mechanism 
for cell proliferation. They respond to external stimuli 
such as growth factors and hormones which bind to the cell 
membrane and trigger an intracellular cascade of chemical 
10 reactions known generally as -signal transduction". These 
chemical signals lead to the cell nucleus, where they 
activate gene transcription and the process of cell 
division. When the extracellular stimulus stops, the 
internal signaling pathway is deactivated and the cell 
15 returns to its resting state. 

When dominant mutations occur in the Ras gene 
(turning it into a Ras oncogene) , mutant proteins are 
produced that lock the signaling pathway in an active 
state, resulting in uncontrolled cell division. Dominant 
20 mutations in the Ras gene are presently the second most 

common type of mutations in human cancers. They are found 
in more than 50% of all human cancers including pancreatic 
cancer (95%), colon cancer (50%), lung cancers (45%) and 

leukemias (30%) . 
25 The functional activity of Ras protein depends 

on its attachment to the cell membrane, which in turn is 
mediated by protein prenylation. Farnesylation of the Ras 
protein in yeast and humans is required for its oncogenic 
transforming activity. Because the Ras protein is 
30 modified sequentially by the three different processing 
enzymes described above (which farnesylate, cleave and 
methylate the C-terminal CAAX motif) , each of these 
processing enzymes represents a potential target that 
could be attacked pharmacologically to block the oncogenic 
35 action of mutant Ras protein. 

Protein farnesyl transferases have become an 
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active target for drug development. Inhibitors of protein 
farnesyl transferases are being developed, for example/ as 
potential therapeutics for Ras-based tumors. See, e.g., 
F. Tamanoi, Trends Biochem. Sci . , 18, p. 349 (1993); N. E. 

5 Kohl et al., J- Cell. Biochem, Suool , , 22. p. 145 (1995). 
The methyltransf erase enzyme will also be an attractive 
target for drug development. 

The mammalian counterparts to the yeast CAAX 
endoproteinase and methyltransf erase genes have not been 

10 isolated. The mammalian homologs to each of the three 

yeast CAAX processing enzymes each represent a potential 
target to block the oncogenic action of mutant Ras protein 
in tumors, or more generally, to modulate the activity of 
prenylated peripheral membrane proteins. It would thus be 

15 desirable to identify and to isolate the mammalian 
homologs to the yeast CAAX endoproteinase and 
methyltransf erase genes. 

SUMMARY OF THE INVENTION 
The present invention provides mammalian DNA 
20 sequences that display a high degree of sequence identity 
to their counterpart genes in Saccharomyces cerevisiae, 
RCE1 and AFC1, encoding prenylation-dependent CAAX 
endoproteinases . Specifically, complementary DNA (cDNA) 
sequences of the human and mouse RCE1 functional homologs 
25 are provided. Human cDNA sequences encoding proteins 

having a high degree of amino acid sequence identity to 
the yeast Afclp protein are also provided. This invention 
is also directed to recombinant DNA molecules comprising 
the mammalian DNA sequences, DNA molecules and antisense 
30 RNA molecules which hybridize under stringent 

hybridization conditions to those DNA sequences, hosts 
transformed with the recombinant DNA molecules and protein 
expression products produced by culturing the transformed 
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hosts. Antibodies directed against the protein expression 
products are also provided. Also provided are assays to 
identify inhibitors of one or more mammalian CAAX 
processing enzymes and kits for making the above products 
5 and performing the above assays. Finally, this invention 
provides pharmaceutical compositions comprising an 
inhibitor of a mammalian CAAX processing enzyme, and 
methods for treating a CAAX-protein mediated disease or 
disorder in a patient by administering such a 
10 pharmaceutical composition. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A. Amino acid sequence alignment of yeast RCE 
with a human EST. 

Figure IB. Amino acid sequence alignment of yeast, AFC1 
15 (NAG2) with a human EST. 

Figure 1C. Amino acid sequence alignment of yeast STE14 
with a mouse EST. 

Figure 2A. Human hRCE cDNA (SEQ ID NO:l) and predicted 
protein sequences (SEQ ID NO:2). 

20 Figure 2B. Mouse mRCE cDNA (SEQ ID NO: 3) and predicted 
protein sequences from the first in-frame ATG codon (SEQ 
ID NO: 4) . 

Figure 2C. Mouse jilRCE cDNA sequence with additional 5' 
genomic sequence relative to Figure 2B (SEQ ID NO: 5). Tl 
25 first in-frame ATG codon is underlined. The sequence 
obtained exclusively from the mouse genomic clones is 
represented from nucleotides 1-152 (SEQ ID N0:5). See 
(SEQ ID NO: 6) for the predicted protein sequence. 
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Figure 3. Amino acid sequence alignment of the yeast 
Rcelp protein (RCE1.PR0), the human hRCE protein 
(HRCE1.PR0) and the mouse mRCE (MRCEORF. PRO) protein. The 
alignment utilized the Clustal method with PAM250 residue 
5 weight table. 

Figure 4. Northern blots of hRCE mRNA expression levels 
in various human tissues. The source tissue for the 
poly(A) RNA is indicated along the top, and the size 
standards (kbp) along the left side of each blot. The 
10 hRCE message appears to be approximately 1.4 kbp and was 
expressed in all tissues examined. 

Figure 5. Primer extension analysis of hRCE mRNAs. Human 
placental and pancreatic poly (A) RNA was hybridized to a 
hRCE specific primer and extended using reverse 

15 transcriptase. The extension products were resolved by 
polyacrylamide gel electrophoresis, along with the 
products of a sequencing reaction to serve as size 
standards. The transcriptional start sites appear to be 
in the region from -10 to -26 (relative to the valine 

20 codon GTG at position 116-118; Figure 2A) . 

Figure 6. Yeast halo assays of wild-type strain (W303) 
(MAT- a; leu2-3,112; ade2-101; his3-ll; trpl-1; ura3-l) , a 
strain lacking the AFC1 gene (Aafcl), a strain lacking the 
RCE1 gene (Arcel), and a strain lacking both AFC1 and RCE1 

25 (AA) are shown in the top four panels. Halo assays of a 
strain lacking both AFC1 and RCE1 genes expressing either 
no human protein (AApJPl), hRCE expressed from the yeast 
MEV1 promoter (AApACA7001) , or hRCE expressed from the 
yeast ERG12 promoter (AApACA7002) are shown in the 

30 lower three panels. Halos are present when hRCE is 

expressed in yeast indicating that the human protein is 
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functional in yeast (see Example 3). 

Figure 7. Human hAFC cDNA and protein sequences. 

Figure 8. Amino acid sequence alignment between the 
full length yeast Afclp protein (YAFC1.PRO) and the 
5 human hAFC protein (HAFCl.PRO). The alignment utilized 
the Clustal method with PAM250 residue weight table. 

Figure 8. Amino acid alignment between the full length 
yeast Afclp protein (YAFC1.PR0) and the human hAFC protein 
(HAFC1.PR0). The alignment utilized the Clustal method 
LO with PAM250 residue weight table. 

Figure 9. Amino acid alignment between a region of the 
human hRCElp protein (SEQ ID NO: 2, amino acid residues 
166-233) comprising the histidine-rich sequence, HxHH (SEQ 
ID NO:2, amino acid residues 172-175) and nucleic acid 
15 sequences in GenBank. The alignment utilized the Clustal 
method with PAM250 residue weight table. 

Figure 10. Yeast halo assays performed with the following 
strains: wild-type strain (W303) (MAT-a; leu2-3,112; ade2- 
101; his3-ll; trpl-1; ura3-l); a strain lacking the RCE1 
20 gene (Arcel); a strain lacking both AFC1 and RCE1 (Aafcl, 
Arcel); a strain transformed with a URA3* marked, high-copy 
(2u) plasmid carrying wildtype hRCE expressed from the 
yeast ERG 12 promoter (pAB13 wild-type) ; a strain 
transformed with a pAB13-derived plasmid in which the 
25 histidine at position 172 of hRCE was changed to an 

alanine (pABIS H172A) ; a strain transformed with a pAB13- 
derived plasmid in which the histidine at position 174 of 
hRCE was changed to an alanine (pABl€ H174A) ; a strain 
transformed with a pAB13-derived plasmid in which the 



SUBSTITUTE SHEET (RULE 26) 



WD 98/54333 PCT/US98/11415 

- 9 - 

histidine at position 175 of hRCE was changed to an 
alanine (pAB17 H17 5A) . Halos are present when hRCE is 
expressed in yeast indicating that the human protein is 
functional in yeast (see Example 3) . 

5 Figure 11. Heat shock assays in wild-type yeast cells 

(W303) (MATa; leu2-3,112; ade2-101; his3-ll; trpl-1; ura3- 
1) transformed with plasmid pAB20 expressing the activated 
RAS2-val 19 gene (dominant RAS2 allele) pWT") ; and in a 
W303-derived strain lacking both the Rcelp and Afclp CAAX 
10 proteases (drcel, Aafcl) co-transformed with one of the 
following URA3 marked, high-copy (2u) plasmids: pABl 
(vector control), pAB13 (hRCEl) , pAB15 (H172A) , pAB16 
(H174A) ; and pAB17 (H175A) (as described in Figure 10) . 
Percent (%) viability after heat shock treatment is shown 
15 for each strain (see Example 9) . 

PRIMER SEQUENCES 

Oligonucleotides or "primers" were all purchased 
from Operon Technologies (Alameda, CA) and consisted of 
the following sequences: hRCE(lR)-RI (SEQ ID NO:7) CGG 
20 AAT TCT CAA TAA TAT GGT GAA AAT GGG C; hRCE(2R)-RI (SEQ ID 
NO: 8) CGG AAT TCG GAT GAA GAG GAA AGC AGT G; hRCE(3F)-RI 
(SEQ ID NO: 9) GCG AAT TCT TCG GTG CCT ACA CTG CTT TC; 
hRCE(4F)-RI (SEQ ID NO: 10) GCG AAT TCC CAT TTT CAC CAT ATT 
ATT GAG C; hRCE (5R) (SEQ ID NO: 11) CTA ACA TGG GCA GCA TAC 
25 AGG; hRCE ( 6R) (SEQ ID NO: 12) CTG GTG AAG CGT CGC TTG A; 
hRCE (7 F) (SEQ ID NO: 13) ATC CAG CCA GGC ACA TCC; 
hRCE (8 F) (SEQ ID NO: 14) TTG CCT CAC AGA CAT GCG TTG; 
hRCE (9F) (SEQ ID NO: 15) AAC GCA TGT CTG TGA GGC AAC; 
hRCE(10R)-RI (SEQ ID NO: 16) CGG AAT TCG CTA ACA TGG GCA 
30 GCA TAC AGG CC; hRCE(HR) (SEQ ID NO: 17) GGT CAG GAG CAC 
AGG GGA GC; hRCE(12F) (SEQ ID NO: 18) GAA GGG CTG GCT GGG 
GTC; hRCE(13R) (SEQ ID NO: 19) GGT CCC TGG GAC GCA ACT AC- 
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hRCE(HF) (SEQ ID NO:20) CGG GGC GGC GCT GGG CGG GGA TGG 
GC; hRCE(15R) (SEQ ID NO: 21) AGG CTG CCC ACG TAG GAG CAG 
GCG AGG CTG A, hRCE-ValX (SEQ ID NO:22) GTG CTC TAG ACT 
GTC AGT GTT CTC CTG CCT CAG CCT C; hRCE3P-RI (SEQ ID 
5 NO:23) CCG GGA ATT CAG GAG CAC AGG GGA GCT TGT GAG TCC C; 
hRCE3P-Xba (SEQ ID NO:24) CCG GTC TAG AGG AGC ACA GGG GAG 
CTT GTG AGT CCC; 608(1F)-RI (SEQ ID NO:25) CGG AAT TCC CTG 
CTG TGT TCA CCT GCC CA; 608(2F)-RI (SEQ ID NO: 26) CGG AAT 
TCC TCC TAC ACC GCT GTC TTC GGT G; 608(1R)-RI (SEQ ID 
10 NO:27) CGG AAT TCC CAG CGC ATG TCT GTG AGG CAG; 608(2R)-RI 
(SEQ ID NO: 28) CGG AAT TCT GGG CAG GTG AAC ACA GCA GG; 
hAFC (1R) ~RI (SEQ ID NO:29) GCG AAT TCC TGG AAG ATG ATC AAT 
AGT CCA AT; hAFC (2R) (SEQ ID NO: 30) TCT GGG CTT TTC TGT ACT 
TAA TT; hAFC (3F) (SEQ ID NO: 31) GGA ACA GTG AAG AAA TAA AAG 
15 CTA AAG; hAFC (4F) (SEQ ID NO: 32) TGA GGA GGT ACT CGC TGT 

ACT AGG; hSTE14(lF)-RI (SEQ ID NO: 33) GCG AAT TCG AGA ACA 
TCT TCT GGC CAG, hSTE14 (2R) -RI (SEQ ID NO:34) GCG AAT TCT 
GGC CAG AAG ATG TTC TCG; hSTE14 (3F) (SEQ ID NO: 35) CTA CCA 
GAT AGC CAT CAG AGC; hSTE14 (4R) (SEQ ID NO: 36) GCT CTG ATG 
20 GCT ATC TGG TAG. 

Definitions 

Unless otherwise defined, all technical and 
scientific terms used herein have the meaning as commonly 
understood by one of ordinary skill in the art to which 
25 this invention belongs . The practice of the present 
invention employs, unless otherwise indicated, 
conventional techniques of chemistry, molecular biology, 
microbiology, recombinant DNA, genetics and immunology. 
See, e.g., Maniatis et al., 1982; Sambrook et al., 1989; 
30 Ausubel et al., 1992; Glover, 1985; Anand, 1992; Guthrie 
and Fink, 1991 (which are incorporated herein by 
reference) . 

An "isolated" or "substantially pure" nucleic 
acid, polynucleotide (e.g., an RNA, DNA or a mixed 
35 polymer) or polypeptide is one which is substantially 
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separated from other cellular components that naturally 
accompany the native sequence or protein in its natural 
host cell, e.g., ribosomes, polymerases, many other 
mammalian genome sequences and proteins. The term 
5 embraces a nucleic acid sequence or polypeptide that has 
been removed from its naturally occurring environment, and 
includes recombinant or cloned DNA isolates and chemically 
synthesized analogs or analogs that are biologically 
synthesized by heterologous systems. 
10 The term "nucleic acids" or "polynucleotides" is 

also extended to refer to nucleic acids that encode a RCE1 
or AFC1 polypeptide, polypeptide fragment, homologs and 
variants of those polypeptides, and protein fusions and 
deletions of any of the above. These nucleic acids 
15 comprise a sequence which is either derived from, or 

substantially similar to a natural CAAX protease-encoding 
gene or one having substantial homology with a natural 
CAAX protease-encoding gene or a portion thereof. 

The polynucleotide compositions of this 
20 invention may include both sense and antisense strands of 
RNA, cDNA, genomic DNA, and synthetic forms and mixed 
polymers of the above. They may be modified chemically or 
biochemically or may contain non-natural or derivatized 
nucleotide bases, as will be readily appreciated by those 
25 of skill in the art. Such modifications include, for 

example, labels, methylation, substitution of one or more 
of the naturally occurring nucleotides with an analog, 
internucleotide modifications such as uncharged linkages 
(e.g., methyl phosphonates, phosphotriesters, 
30 phosphoramidates, carbamates, etc.), charged linkages 
(e.g., phosphorothioates, phosphorodithioates, etc.), 
pendent moieties (e.g., polypeptides), intercalators 
(e.g., acridine, psoralen, etc.), chelators, alkylators, 
and modified linkages (e.g., alpha anomeric nucleic acids, 
35 etc.) Also included are synthetic molecules that mimic 

polynucleotides in- their ability to bind to a designated — 
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sequence via hydrogen bonding and other chemical 
interactions. Such molecules are known in the art and 
include, for example, those in which peptide linkages 
substitute for phosphate linkages in the backbone of the 

5 molecule. . 

The term "substantial homology or similarity" 
when referring to a nucleic acid or fragment thereof 
indicates that, when optimally aligned with appropriate 
nucleotide insertions or deletions with another nucleic 
10 acid (or its complementary strand) , there is nucleotide 

sequence identity in at least about 60% of the nucleotide 
bases, usually at least about 70%, more usually at least 
about 80%, preferably at least about 90%, and more 
preferably at least about 95-98% of the nucleotide bases. 
■ Alternatively, substantial homology or 

similarity exists when a nucleic acid or fragment thereof 
hybridizes to another nucleic acid, to a strand of another 
nucleic acid, or to the complementary strand thereof, 
under selective hybridization conditions. Typically, 
20 selective hybridization will occur when there is at least 
about 55% homology - preferably at least about 65%, more 
preferably at least about 75%, and most preferably at 
least about 90% — over a stretch of at least about 14 
nucleotides. See, e.g., Kanehisa, 1984. The length of 
25 homology comparison, as described, may be over a stretch 

of at least about nine nucleotides, usually at least about 
20 nucleotides, more usually at least about 24 
nucleotides, typically at least about 28 nucleotides, more 
typically at least about 32 nucleotides, and preferably at 
30 least about 36 or more nucleotides. 

Nucleic acid hybridization will be affected by 
such conditions as salt concentration, temperature, 
solvents, the base composition of the hybridizing species, 
length of the complementary regions, and the number of 
35 nucleotide base mismatches between the hybridizing nucleic 
acids, as will be readily appreciated by those skilled in. 
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the art. 

"Stringent hybridization" and "stringent 
hybridization wash conditions" in the context of nucleic 
acid hybridization experiments differ depending on 
5 different physical parameters such as temperature/ the 
base composition and concentration of the reactants and 
the salt concentration. Stringent temperature conditions 
will generally include temperatures in excess of 30°C, 
typically in excess of 37°C, and preferably in excess of 

10 45°C. Stringent salt conditions will ordinarily be less 
than 1000 mM, typically less than 500 mM, and preferably 
less than 200 mM. However, the combination of parameters 
is much more important than the measure of any single 
parameter. See, e.g., Wetmur and Davidson, 1968. Probe 

15 sequences may also hybridize specifically to duplex DNA 
under certain conditions to form triplex or other higher 
order DNA complexes. The preparation of such probes and 
suitable hybridization conditions are well known in the 
art. 

20 Generally, stringent hybridization and wash 

conditions are selected to be about 5°C lower than the 
thermal melting point (T m ) for the specific sequence at a 
defined ionic strength and pH. The T m is the temperature 
at which 50% of the target sequence hybridizes to a 

25 perfectly matched probe. An example of stringent 
hybridization conditions for hybridization of 
complementary nucleic acid sequences having more than 100 
complementary residues on a filter in a Southern or 
northern blot is 50% formamide with 1 mg of heparin per 

30 50mL at 42°C for at least ten hours. An example of 

stringent wash conditions is a 0.2x SSC wash at 65°C for 
15 minutes (see Sambrook et al., supra , for SSC buffer). 
Often the high stringency wash is preceded by a low 
stringency wash to remove excess probe. An exemplary 

35 medium stringency wash for duplex DNA of more than 100 

base pair^is.-lxSSC at 45°C for 15 minutes. An ex empl a ry . 
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low stringency wash for such a duplex is 4xSSC at 40°C for 
15 minutes. In general, signal-to-noise ratio of 2x or 
higher than that observed for an unrelated probe in the 
particular hybridization assay indicates detection of a 
5 specific hybridization. 

As defined herein, nucleic acids which do not 
hybridize to each other under stringent conditions are 
still substantially identical to those which do hybridize 
under stringent conditions if the polypeptides which they 
10 encode are substantially identical. This occurs, for 

example, when a nucleic acid is created synthetically or 
recombinantly using a high codon degeneracy as permitted 
by the redundancy of the genetic code. 

The term "identical" in the context of nucleic 
15 acid or polypeptide sequences refers to the residues in 
the two sequences which are the same when aligned for 
maximum correspondence. When percentage of sequence 
identity is used in reference to proteins or peptides, it 
is recognized that residue positions which are not 
20 identical often differ by conservative amino acid 

substitutions — where amino acid residues are substituted 
for other amino acid residues having side chains with 
similar chemical properties (e.g., charge or 
hydrophobicity) which do not substantially change the 
25 functional properties of the molecule. Where sequences 

differ in conservative substitutions the percent sequence 
identity may be adjusted upwards to correct for the 
conservative nature of the substitution. Means for making 
this adjustment are well known to those of skill in the 
30 art (see, e.g., Pearson et al., Mpfhods in Modular 
Biology . 24, pp. 307-31 (1994)). 

The terms "substantial homology" or "substantial 
identity", when referring to polypeptides, indicate that 
the polypeptide or protein in question exhibits at least 
35 about 30% identity with an entire naturally-occurring 

protein or a portion thereof, usually at least-about-70% 

SUBSTITUTE SHEET (RULE 26) 



WO 98/54333 PCT/US98/11415 

- 15 - 

identity, and preferably at least about 95% identity. 

The term "substantially similar function" refers 
to the function of a modified nucleic acid or a modified 
polypeptide (or protein) with reference to its wild-type 
5 nucleic acid or polypeptide counterpart- The modified 

CAAX protease polypeptide will be substantially homologous 
to the wild-type CAAX protease polypeptide and will have 
substantially the same function. The modified polypeptide 
may have an altered amino acid sequence and/or may contain 
10 modified amino acids. 

In addition to the similarity of function, a 
modified CAAX protease polypeptide may have other useful 
properties, such as an altered (e.g., longer) half-life. 
The similarity of function (activity) of the modified 
15 polypeptide may be substantially the same as the activity 
of the wild-type CAAX protease polypeptide. 
Alternatively, the similarity of function (activity) of 
the modified polypeptide may differ from the activity of 
the wild-type polypeptide in, for example, substrate 
20 specificity and/or enzymatic activity. The modified 

polypeptide is synthesized using conventional techniques, 
or is encoded by a modified nucleic acid and produced 
using conventional techniques. The modified nucleic acid 
is prepared by conventional techniques. A nucleic acid 
25 with a function substantially similar to the wild-type 

RCE1 or AFC1 gene function produces the modified protein 
as described above. 

Homology, for polypeptides, is typically 
measured using sequence analysis software. See, e.g., the 
30 Sequence Analysis Software Package of the Genetics 

Computer Group, University of Wisconsin Biotechnology 
Center, 910 University Avenue, Madison, Wisconsin 53705. 
Protein analysis software matches similar sequences using 
measure of homology assigned to various substitutions, 
35 deletions and other modifications. The length of 
polypeptide sequences compared for homology will ge n e r all y 
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be at least about 16 amino acids, usually at least about 
20 residues, more usually at least about 24 residues, 
typically at least about 28 residues, and preferably more 
than about 35 residues. 

"Conservatively modified variations" of a 
particular nucleic acid sequence refers to nucleic acids 
that encode identical or essentially identical amino acid 
sequences or DNA sequences where no amino acid sequence is 
encoded, because of the degeneracy of the genetic code, a 
large number of functionally identical nucleic acids 
encode any given polypeptide sequence. When a nucleic 
acid sequence is changed at one or more positions with no 
corresponding change in the amino acid sequence which it 
encodes, that mutation is called a "silent mutation". 
15 Thus, one species of a conservatively modified variation 
according to this invention is a silent mutation. 
Accordingly, every nucleic acid sequence herein which 
encodes a polypeptide also describes every possible silent 
mutation or variation. 

Furthermore, one of skill in the art will 
recognize that individual substitutions, deletions, 
additions and the like, which alter, add or delete a 
single amino acid or a small percentage of amino acids 
(less than 5%, more typically less than 1%) in an encoded 
25 sequence are "conservatively modified variations" where 
the alterations result in the substitution of one amino 
acid with a chemically similar amino acid. Conservative 
substitution tables providing functionally similar amino 
acids are well known in the art. The following six groups 
each contain amino acids that are conservative 
substitutions for one another: 



20 



30 



1) Alanine (A), Serine (S) , Threonine (T) ; 

2) Aspartic Acid (D) , Glutamic Acid (E) ; 

3) Asparagine (N) , Glutamine (Q) ; 

4) Arginine (R)-,— Lysine— {JQ-; 



SUBSTITUTE SHEET (RULE 26) 



WD 98/54333 PCTAJS98/11415 

- 17 - 

5) Isoleucine (I), Leucine (L) , Methionine (M) / 
Valine (V) , and 

6) Phenylalanine (F), Tyrosine (Y) , Tryptophan (W) . 

The terms "RCE1 (or AFC1) protein" or "RCE1 (or 
5 AFC1) polypeptide" refer to a protein or polypeptide 
encoded by the RCE1 (or AFC1) locus, and variants or 
fragments thereof. The term "polypeptide" refers to a 
polymer of amino acids and its equivalent and does not 
refer to a specific length of the product; thus, peptides, 

10 oligopeptides and proteins are included within the 

definition of a polypeptide. This term also does not 
exclude modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations, and the 
like. Included within the definition are, for example, 

15 polypeptides containing one or more analogs of an amino 
acid (including, e.g., unnatural amino acids), 
polypeptides with substituted linkages as well as other 
modifications known in the art, both naturally and 
non-naturally occurring. Ordinarily, such polypeptides 

20 will be at least about 50% homologous to the native CAAX 

protease sequences, preferably in excess of about 90%, and 
more preferably at least about 95% homologous. Also 
included are proteins encoded by DNA which hybridize under 
high or low stringency conditions, to CAAX 

25 protease-encoding nucleic acids and closely related 

polypeptides or proteins retrieved by antisera raised 
against such protein (s). 

A polypeptide "f ragment", "portion" or "segment" 
refers to a stretch of amino acid residues of at least 

30 about five to seven contiguous amino acids, often at least 
about seven to nine contiguous amino acids, typically at 
least about nine to 13 contiguous amino acids and, most 
preferably, at least about 20 to 30 or more contiguous 
amino acids. 
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The term "protein modifications" refers to RCE1 
(or AFC1) polypeptides or fragments thereof that are 
substantially homologous in primary structural sequence 
but which include, e.g., in vivo or in vitro chemical and 
5 biochemical modifications or which incorporate unusual 
amino acids. Such modifications include, for example, 
acetylation, carboxylation, phosphorylation, 
glycosylation, ubiquitination, labeling, e.g., with 
radionuclides, and various enzymatic modifications, as 
10 will be readily appreciated by those well skilled in the 
art. A variety of methods for labeling polypeptides and 
of substituents or labels useful for such purposes are 
well known in the art, and include radioactive isotopes 
such as 32 P, ligands which bind to labeled antiligands 
15 (e.g., antibodies), f luorophores, chemi luminescent agents, 
enzymes, and antiligands which can serve as specific 
binding pair members for a labeled ligand. The choice of 
label depends on the sensitivity required, ease of 
conjugation with the primer, stability requirements, and 
20 available instrumentation. Methods for labeling 

polypeptides are well known in the art. See Sambrook et 
al., 1989 or Ausubel et al., 1992. 

The term "fusion protein" refers to fusion 
polypeptides comprising RCE1 or AFC1 polypeptides and 
25 fragments. Homologous polypeptide fusions may be between 
two or more polypeptide sequences or between sequences 
derived from RCElp or AFClp and a related protein. 
Heterologous fusions may be constructed which would 
exhibit a combination of properties or activities of the 
30 derivative polypeptides. For example, active site or 
other domains may be "swapped" between CAAX protease 
polypeptides and other polypeptides or fragments. Such 
homologous or heterologous fusion polypeptides may display 
altered biological characteristics (e.g., altered 
35 enzymatic activity, substrate specificity and/or altered 
cellular localization) . — - 
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The terms "isolated", "substantially pure", 
"purified", "purified and isolated" and "substantially 
homogeneous " are used interchangeably to describe a 
protein or polypeptide that has been separated from 
5 components that accompany it in its natural state. A 
monomeric protein is substantially pure when at least 
about 60 to 75% of a- sample exhibits a single polypeptide 
sequence. A substantially pure protein will typically 
comprise about 60 to 90% W/W of a protein sample, more 
10 usually about 95% , and preferably will be over 99% pure. 

Protein purity or homogeneity may be indicated by a number 
of means well known in the art, such as polyacrylamide gel 
electrophoresis of a protein sample, followed by 
visualizing a single polypeptide band upon staining the 
15 gel with a stain well known in the art. For certain 

purposes, higher resolution may be provided by using HPLC 
or other means well known in the art for purification. 

A RCE1 or AFC1 protein is substantially free of 
naturally associated components when it is separated from 
20 the native contaminants that accompany it in its natural 

state. Thus, a polypeptide that is chemically synthesized 
or synthesized in a cellular system different from the 
cell from which it naturally originates will be 
substantially free from its naturally associated 
25 components. A protein may also be rendered substantially 
free of naturally associated components by isolation, 
using protein purification techniques well known in the 
art. 

Besides substantially full-length polypeptides, 
30 the present invention provides for biologically active 
fragments of the polypeptides. Significant biological 
activities include CAAX substrate binding, protease 
activity, immunological activity or other (presently 
unidentified) biological activities characteristic of the 
35 CAAX protease polypeptides of this invention. 

Immunological., activities include both immunogenic— function, 

> 
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in a target immune system, as well as sharing of 
immunological epitopes for binding, serving as either a 
competitor or substitute antigen for an epitope of the 

RCE1 or AFC1 protein. 

Expression of recombinant DNA molecules 
according to this invention may involve post-translational 
modification of a resultant polypeptide by the host cell. 
For example, in mammalian cells expression might include, 
among other things, glycosylation, lipidation or 
phosphorylation of a polypeptide, or cleavage of a signal 
sequence to produce a "mature" protein. Accordingly, the 
polypeptide expression products of this invention 
encompass full-length polypeptides and modifications or 
derivatives thereof, such as glycosylated versions of such 
15 polypeptides, mature proteins, polypeptides retaining a 
signal peptide, truncated polypeptides having comparable 
biological activity, and the like. 

As used herein, "epitope" refers to an antigenic 
determinant of a polypeptide, i.e., a region of a 
polypeptide that provokes an immunological response in a 
host. This region needs not comprise consecutive amino 
acids. The term epitope is also known in the art as 
"antigenic determinant". An epitope may comprise as few 
as three amino acids in a spatial conformation which is 
25 unique to the immune system of the host. Generally, an 
epitope consists of at least five such amino acids, and 
more usually consists of at least 8-10 such amino acids. 
Methods for determining the spatial conformation of such 
amino acids are known in the art. 

The term "antibody" refers to a polypeptide 
encoded by an immunoglobulin gene, genes, or fragments 
thereof. The immunoglobulin genes include the kappa, 
lambda, alpha, gamma, delta, epsilon and mu constant 
regions, as well as a myriad of immunoglobulin variable 
35 regions. Light chains are classified as either kappa or 

lambda. Heavy chains are classified as gamma, mu^alpha* 
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delta, or epsilon, which in turn define the immunoglobulin 
classes IgG, IgM, IgA, IgD and IgE, respectively. 

Antibodies exist for example, as intact 
immunoglobulins or as a number of well-characterized 
5 fragments produced by digestion with various peptidases. 
For example, trypsin digests an antibody below the 
disulfide linkages in the hinge region to produce F(ab)' 2 ' 
a dimer of Fab which itself is a light chain joined to a 
V H -C H 1 by a disulfide bond. The F(ab)' 2 *ay be reduced 
10 under mild conditions to break the disulfide linkage in 

the hinge region thereby converting the F(ab)' 2 dimer to a 
Fab' monomer. The Fab' monomer is essentially an Fab with 
part of the hinge region. See Fundamental, Immunology/ 
Third Edition, W.E. Paul, ed. Raven Press, N.Y. (1993) 
15 (incorporated herein by reference) for a detailed 

description of antibodies and antibody fragments. One of 
skill in the art recognizes that such Fab' fragments may 
be synthesized de novo either chemically or using 
recombinant DNA technology. Thus, as used herein, the 
20 term antibody includes antibody fragments produced by the 
modification of whole antibodies or those synthesized de 
novo. The antibodies of the present invention are 
optionally derived from libraries of recombinant 
antibodies in phage or similar vectors (see, e.g., Huse et 
25 al., Science , 246, pp. 1275-81 (1989); Ward et al., 

Nature , 341, pp. 544-46 (1989); Vaughan et al., Nature 
Biotech. , 14, pp. 309-14 (1996) which are incorporated 
herein by reference) . 

30 DETAILED DESCRIPTION OF THE INVENTION 

Identifying Human ESTs For Use as Probes 

We performed extensive database searches for DNA 
sequences capable of encoding protein sequence with 
significant sequence identity with the protein products of 
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the yeast RCE1, AFC1, and STE14 genes using the search 
engine XREF (Bassett et al., Trends in Genetics, 11/ pp. 
372-373 (1995) . Three expressed sequence tags (ESTs) were 
found with extensive amino acid sequence identity to the 
5 corresponding yeast genes (Figures 1A-C) . The human EST 
NCBI#604642 (Genbank Accession # W96411) showed 
considerable sequence identity with the yeast RCE1 gene 
product, Rcelp, over approximately 90 amino acids (Figure 
1A) . In addition, the EST possessed 3 conserved histidine 
10 residues in a tetrapeptide sequence we believe is critical 
for catalysis (HxHH at positions 194-197 of Rcelp; see 
Figure 1A, fragment 188-202). 

The human EST NCBI#482520 (Genbank Accession # 
N76181) showed 45 percent sequence identity with the 
15 product of the AFC1 gene (Afclp) over approximately 80 
amino acids (Figure IB) . The murine EST NCBI#636296 
(Genbank Accession # AA022288) sequence showed 36 percent 
identity with the product of the STE14 gene (Stel4p) over 
67 amino acids. 

20 Isolating human hRCE cDNA Homologs 

To isolate the human cDNA counterpart to the 
yeast RCE1 gene, we screened a Xgtll cDNA library made 
from a 9-week old human fetus. The library (ATCC #77433) 
was purchased from the American Type Culture Collection 
25 (see Example 1) . Briefly, phage were plated onto an JLu 
qqXL host strain, lifted onto nylon membranes and 
hybridized under stringent conditions to radio-labeled DNA 
sequences which were isolated from the human EST 
(ID#358628) (see above) . Two phage plaques, 13R and 27R, 
30 showed a strong hybridization signal to the human EST 

(#358628) probe and were purified to homogeneity by two 
successive rounds of plating and hybridization (Example 
1) . 
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Sequencing the human hRCE cDNA 

Phage stocks of the Xgtll clones 13R and 27R were 
prepared and titered, and DNA was prepared from these two 
clones (Example 1) . The Xgtll clone 27R contains a human 
5 cDNA insert of approximately 1.4 kB. An E. coli strain 
(LE392) infected with a purified phage stock of Xgtll 
clone 27R was deposited as strain designation >X 27R-Xgtll" 
on May 7, 1997 with the American Type Culture Collection 
(ATCC) (Rockville, MD) according to the provisions of the 
10 Budapest Treaty, and was assigned the ATCC accession 

number designated: ATCC #209020, All restrictions on the 
availability to the public of the above ATCC deposit will 
be irrevocably removed upon the granting of a patent on 
this application. 
15 The 5 ? and 3 1 ends of the X clones were 

amplified by PCR as described in Example 1. Additional 
sequence information for the human hRCE cDNA was obtained 
by 5' and 3' RACE methods (Siebert et al., NmclftiiC Acids 
Research , 23, pp. 1087-1088 (1995)) (see Example 1). 
20 The consensus sequence derived from the two 

human Xgtll cDNAs, 13R and 27R, together with the human 
RACE PCR products revealed the presence of an open reading 
frame extending from the 5' end of the sequence to 
nucleotide 996 (SEQ ID NO:l) (Figure 2A and see below). 
25 5' RACE reactions were also performed on mouse 

cDNA and a number of products were isolated and cloned 
into pCR2.1 (obtained from Invitrogen, Inc.). Two genomic 
mRCE clones, 20 (a 10.5 kbp BamHI fragment in pBluescript 
SK) and 75 (an 8.5 kbp EcoRI fragment in pbluescript SK) 
30 were sequenced using the primer RCE-15R. These mouse 
clone sequences were assembled into one continuous DNA 
sequence ( M a contig") with the aid of a number of murine 
ESTs (Genbank ID Nos: AA072190, AA154658, AA168614, 
W14344, AA154864, AA218396) . The continuous cDNA and 
35 genomic sequences are shown in Figures 2B and 2C. The 
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mouse RCE (mRCE) cDNA consensus also revealed the presence 
of a continuous ORF that extended from the 5' terminus to 
nucleotide 1004 (SEQ ID N0:3; SEQ ID NO:5) (see Figure 2B) . 
The first ATG codon encountered within the mouse ORF is 
5 underlined in Figure 2C. The predicted protein sequence 
of mRCE (SEQ ID NO: 4; SEQ ID NO: 6) is shown in Figure 2B. 

The first ATG codon encountered within the 
human hRCE ORF (SEQ ID NO:l) is well into the coding 
sequence and is located beyond a region of sequence 
10 identity with the yeast Rcelp sequence (see Figure 3) 

suggesting that translation begins at an upstream site. 
The 5 f terminal "G" residue of the hRCE cDNA sequence (SEQ 
ID NO:l) (see Figure 2A) , which was fused to linker 
sequences during the construction of the cDNA clone, may 
15 be preceded by an "AT" dinucleotide, forming an in-frame 
ATG codon for methionine. As described above, the 
corresponding G residue in the mouse mRCE sequence forms 
an ATG codon which was revealed upon obtaining additional 
genomic sequence information. If that ATG codon in mRCE 
20 is a functional translational start site, it is likely 

that the 5' G residue in the human hRCE sequence shown in 
Figure 2A is also preceded by an AT dinucleotide and the 
hRCE ORF begins with a methionine residue. 

Alternatively, translational initiation may 
25 begin at a non-ATG codon. There are two in-frame GTG 
codons at positions 116-118 and 122-124 (SEQ ID 
NO:l) which are known to function as translational start 
sites on eukaryotic mRNAs. Initiation at either of these 
codons would encode a protein of very similar size and 
30 composition as the yeast counterpart. 

Human hRCE RNA Analysis 

To resolve whether the DMA sequence we 
determined from the Agtll cDNA clones and numerous 5' RACE 
pr oducts reflected the bona fide 5' terminus of the^hRCE — 
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mRNA, we performed Northern blot analyses and primer 
extension experiments with human mRNA (Example 2) . 

The Northern blots revealed the presence of a 
single abundant mRNA species approximately 14 00 base-pairs 
5 in length. The expression pattern indicated that the hRCE 
gene is expressed in all tissues examined. The highest 
expression was found in placenta and pancreatic mRNA 
samples. The lowest expression was found in the brain. 
The size of the hRCE mRNA was consistent with 
10 transcription beginning at or within the sequence shown in 
Figure 2A. 

To map more precisely the actual 5 f ends of the 
hRCE mRNA, we performed primer extension experiments 
(Example 2) . The reaction products appeared to be the 
15 same when the reaction were performed under different 

conditions, and were the same from the two tissue types 
(Figure 5) . The transcriptional start sites appear to 
range from -10 to -26 relative to the valine codon GTG at 
position 116-118 (SEQ ID NO:l) (Figure 2A) . 

20 Expression of the human hRCE cDNA 

The most simple interpretation of the above 
analyses is that the start-site of translation of both the 
hRCE and mRCE genes is at the GTG codon located at 
position 116. To test whether the ORF beginning at 
25 position 116 encodes a functional counterpart to the yeast 
gene, we expressed this portion of the human ORF (SEQ ID 
N0:1 nucleotide 116 to 997, Figure 2A) in a yeast strain 
deleted for both CAAX protease genes, yAFCl and yRCEl 
(Boyartchuk et al., supra ) (Example 3). The human hRCE 
30 ORF was cloned into one of two expression vectors (pACAl 1 
or pACA5M under the control of the yeast MEV1 promoter 
and an ATG translational initiation codon, or the yeast 
ERG12 prom oter and the first four codons of the EBG12 



SUBSTITUTE SHEET (RULE 26) 



WO 98/54333 



- 26 - 



PCTAJS98/11415 



coding sequence, respectively. The resulting plasmids, 
P ACA7001 (also referred to as pAB14) and pACA7002 (also 
referred to as pAB13) , represent translational fusions 
between the MEV1 ORF and the ERG12 ORF, respectively. 
5 Both expression vectors contained the yeast PGK1 
transcriptional termination sequence. 

E. coli strains transformed with either pAB13 or 
pAB14 were deposited as strain designations w " and 
respectively, on June , 1998 with the American 
10 Type Culture Collection (ATCC) (Rockville, MD) according 
to the provisions of the Budapest Treaty, and were 
assigned the ATCC accession number designated: ATCC # 

and ATCC # • All restrictions on the 

availability to the public of the above ATCC deposits will 
15 be irrevocably removed upon the granting of a patent on 
this application. 

The yeast ^-mating factor is a CAAX-containing 
protein and must undergo C-terminal proteolysis in order 
to secrete biologically active pheromone. We tested 
20 whether the human hRCE expression constructs, P ACA7001 and 
pACA7002, were functional in yeast by their ability to 
restore production of a-factor in strain JRY5317 deleted 
for the two endogenous CAAX proteases. Plasmids pACA7001 
and pACA7002 were each transformed into the yeast strain 
25 JRY5317, an a.-mating type strain in which both CAAX 
protease genes have been deleted (Boyartchuk et al., 
snora ) (Example 3) . We performed a halo assay by replica 
plating patches of the yeast trans formants onto a lawn of 
a sst2 tester lawn. Cells harboring an sst2 mutation are 
30 rendered supersensitive to mating pheromone and undergo a 
Gl arrest when exposed to pheromone. 

Cells harboring either one of the hRCE 
expression plasmids, but not the vector control (vector 
with no hRCE insert) , produced significant amounts of 
35 biologically active a-factor (Figure 6) . This exp eriment 
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demonstrates that the human hRCE cDNA (Figure 2A) is the 
functional human counterpart to the yeast RCE1 gene and 
has a conserved substrate specificity extending to yeast 
a-f actor. This experiment also demonstrates that the ORF 
5 that begins at the GTG codon (SEQ ID N0:1, nucleotides 
116-118) and ends at the TGA stop codon (SEQ ID N0:1, 
nucleotides 995-997) encodes a functional human CAAX 
protease. 

Mammalian homologs of the yeast AFC1 gene 
10 Comparison of the yeast AFC2 gene with the NCBI 

EST database revealed the presence of several ESTs with 
significant sequence identity. The Genbank Accession 
numbers for the ESTs are: AA359058, H02497, AA210930, 
AA295448, Z43273, N76181, T35312, F11310, R54272, AA01886, 
15 AA134125, Z45538, H15202, and N30505. The sequences were 
aligned and found to contain a continuous ORF from the 5' 
end to nucleotide 721 (SEQ ID NO:37) (see Figure 7). The 
predicted amino acid sequence (SEQ ID NO: 38) shares 
extensive sequence identity with the yeast Afclp protein, 
20 including the proposed active site sequence HExxH 

(Boyartchuck et al., suera) which is characteristic of 
zinc metalloproteases (Figure 8) . Thus, this sequence 
likely encodes the functional counterpart in humans to the 
yeast CAAX protease, Afclp (Example 4). 

25 Protein Sequences Related to Human hRcelp 

To identify other known protein sequences 
related to those comprising human hRCElp, we performed a 
BLASTP search of GenBank with a fragment of the human Reel 
protein sequence including the histidine-rich sequence 
30 "HxHH" (SEQ ID N0:2, amino acids 172-175) . The search 
revealed sequence similarity with both yeast CAAX 
proteases, yRcelp and yAfclp (Figure 9) . The amino acid 
sequence of yeast Afclp has a zinc metal loprotease 
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consensus sequence (VxxHExxH) (SEQ ID NO: 50) located at 
positions 294-301. In the alignment shown in Figure 9, 
the last histidine of the zinc metalloprotease consensus 
sequence of yAfclp, (H301; histidine at position 301) 
aligns with the histidine at position 172 (H172) of 
hRcelp. The sequence similarity of the histidine-rich 
region of hRcelp to the region of yAfclp comprising the 
presumptive zinc-binding site which functions in catalysis 
suggests that these regions serve similar biochemical 
functions. These similar histidine-rich regions may be 
involved in substrate binding and/or catalysis. Given the 
presence of histidine-rich sequences that are functionally 
important in both proteins and the diverse utilization of 
histidine residues by protease enzymes in binding zinc, a 
role in metal binding and hence, catalysis, is reasonable. 

Afclp in all respects appears to be a zinc- 
dependent metalloprotease based both upon amino acid 
sequence analysis as well as enzymatic sensitivity to the 
zinc chelator, o-phenanthroline (Boyartchuk et al., 
supra ) . Reel has no similar sensitivity to o- 
phenanthroline nor to the chelator EDTA. Reports in the 
literature support the notion that the mammalian CAAX 
protease is either a serine or cysteine protease based 
upon its sensitivity to a farnesylated peptide aldehyde 
25 (Ma, Gilbert and Rando, pi ochftmistry, 32, pp. 2386-2393 
(1993)) and to the thiol modifying reagents PCMB and NEM 
(Hrycyna and Clarke, SL ^ - rhem -- 267 ' PP- 10457-464 
(1992); Ma, Chaudhuri and Rando, Pi orhftmistry* 31, pp. 
11772-777 (1992); Akopyan et al., BBB£/198 , pp. 787-794 
30 (1994)). 

Given the modest sensitivity of the mammalian 
CAAX endoprotease to thiol -modifying reagents and the 
similarity of hRcelp to Afclp (Figure 9) , it is tempting 
to speculate that Rcelp is related to the insulin 
35 degrading enzyme, IDE1. 



20 
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IDE1 is a zinc-dependent thiol protease that is 
a member of a family of such proteases that includes the 
EL. coli protease III (Affholter, J. A., Fried, V.A. , Roth, 
R.A., Science , 242, pp. 1415-1418 (1988)), and the yeast 
5 protease Axllp (Adames, N. , Blundell, K., Ashby, M.N., 
Boone, C, Science . 270, pp. 464-467 (1995)). However, 
the sequence of the yeast or human Reel protein do not 
show any significant sequence similarity to any members of 
this family. 

10 Perhaps the greatest clue as to the functional 

properties of Rcelp comes from the observations that Rcelp 
is inhibited by Zn 2+ (Ki-lOOmM) (Ashby, M.N., King, D.S., 
Rine, J., Proc. Natl. AC3ti, ScA. U.S.A., 89, pp. 4613-4617 
(1992)), that the conserved histidine residues of hRcepl 
15 (H172, H174, H175) are functionally important (see below), 
and that the same histidine-rich region shows similarity 
to Afclp around its putative zinc-binding site described 
herein. These latter observations strongly suggest a role 
of metal-binding in the function of hRcelp. Zinc is the 
20 most likely metal ion to associate with hRcelp, although 
another divalent cation may be involved, such as cobalt, 
calcium or magnesium. 

The notion that hRcelp requires zinc-binding, or 
another divalent cation, will provide valuable information 
25 in the design of potent inhibitors. For instance, 

hydroxamic acid derivatives have found significant utility 
for the inhibition of zinc proteases such as the matrix 
metalloproteinase (Levy, D.E. et al., J. Med, Chem,, 41, 
pp. 199-223 (1998)) and angiotensin-converting enzyme 
30 (Parvathy, S. et al., Blochem. J., 327, pp. 37-43 (1997)). 
Hydroxamic acid confers it inhibitory effect on zinc 
metalloproteases through its metal chelating properties. 
Possible hydroxamic acid hRcelp inhibitors would thus 
include a prenylated cysteine amino acid linked to a 
35 hydroxamate group at it carboxy terminus. The amino group 
of the inhibitor would be preferably linked to a small 
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aliphatic group such as methyl, or more preferably to 
acetyl, or linked to a small number of amino acids, or 
left unmodified. The prenyl group would be linked to the 
cysteine residue through the sulfur group of the cysteine 
and would be composed of a trans, trans, trans- 
geranylgeranyl lipid, or more preferably, a trans, trans- 
farnesyl lipid. 

Mutagenesis of the Histidine-Rich Region of hRCEl 

To determine whether the histidine-rich sequence 
"HxHH" of the human Reel protein (SEQ ID NO:2, amino acids 
H2-175) are required for biological activity, we changed 
each of these histidine residues individually to an 
alanine residue (H172A, H174A and H175A) by site-directed 
mu tagenesis (Example 7) . Each mutant HRCEl gene was 
inserted into a high-copy expression vector and tested in 
parallel with a wild-type hRCEl expression construct for 
function in yeast by their ability to restore production 
of ^-factor as determined by halo assays in strain 
JRY5317 (deleted for the two endogenous CAAX proteases) 
(see Figure 6 and accompanying text, above, and Example 

3) * as shown in Figure 10, cells harboring the wild- 

type hRCE expression plasmid produced significant amounts 
of biologically active a -f actor producing a visible-halo 
> 5 (PAB13; also referred to as P ACA7002 in Figure 6) . In 

contrast, cells harboring mutant hRCE expression plasmids 
in which the histidines at position 172, 174 or 175 were 
individually replaced by alanine produced significantly 
less biologically active a -f actor, as evidenced by the 
30 reduced ( P AB16; H174A and P AB17; H175A) or undetectable 
(pABIS; H172A) halos (Figure 10) . This experiment 
demonstrates that histidine residues H172, H174 and H175, 
comprising the histidine-rich region of human hRCEl 
(conserved between RCE1 and AFC1) contribute to biological 

SUBSTITUTE SHEET (RULE 26) 



WO 98/54333 



- 31 - 



PCT/US98/11415 



activity of hRCElp protein in vivo. 

The ability for the above wild-type and mutant 

hRCEl plasmids to express biologically active CAAX 

protease in vivo was also measured by heat shock assays in 

5 yeast cells transformed with a plasmid expressing the 

19 

mutant RAS2-val gene (dominant RAS2 allele) (see Figure 

11 and Example 9) . 

As described above, a deletion in the RCE1 gene 

suppresses the temperature-sensitive phenotype of cells 

19 

10 expressing activated Ras2p-val (i.e., Rcelp activity is 

required for the functional expression of Ras2p) . Heat- 

19 

shock assays in cells expressing activated Ras2p-val may 

thus be used to assess Rcelp biological activity in vivo 

(see Example 9) . As shown in Figure 11, cells that 

19 

15 express activated Ras2p-val in the presence of a 

functional RCE1 gene (WT; wild- type yRCEl and pAB13; 
hRCEl) are temperature-sensitive after heat shock and have 
reduced assay or with a wild-type Percent (%) viability 
after heat shock treatment is shown for each strain . 

20 Vectors, Cloning and Nucleic Acids 

The present invention provides nucleic acids and 
recombinant DNA vectors which comprise mammalian RCE and 
AFC DNA sequences. More specifically, vectors comprising 
all or portions of the cDNA sequences of the human and 

25 mouse RCE and AFC functional homologs described above are 
provided. The vectors of this invention also include 
those comprising DNA sequences which hybridize under 
stringent conditions to the human and mouse RCE and AFC 
cDNA sequences, and conservatively modified variations 

30 thereof. 

The nucleic acids of this invention include 
single stranded and double-stranded genomic DNA, cDNA, 
RNA, or hybrids thereof and may be isolated from 
biological sources or synthesized chemically or by 
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recombinant DNA methodology. The nucleic acids, 
recombinant DNA molecules and vectors of this invention 
m ay be present in transformed or transfected cells, cell 
lysates, or in partially purified or substantially pure 
5 forms. 

As is well known in the art, DNA sequences may 
be expressed by operatively linking them to an expression 
control sequence in an appropriate expression vector and 
employing that expression vector to transform an 
10 appropriate unicellular host. Such operative linking of a 
DNA sequence of this invention to an expression control 
sequence, of course, includes, if not already part of the 
DNA sequence, the provision of a translation initiation 
codon, ATG, in the correct reading frame upstream of the 

15 DNA sequence. 

A wide variety of host/expression vector 

combinations may be employed in expressing the DNA 
sequences of this invention. Useful expression vectors, 
for example, may consist of segments of chromosomal, non- 
20 chromosomal and synthetic DNA sequences. 

DNA vector design for trans feet ion into 
mammalian cells should include appropriate sequences to 
promote expression of the gene of interest, including: 
appropriate transcription initiation, termination and 
25 enhancer sequences; efficient RNA processing signals such 
as splicing and polyadenylation signals; sequences that 
stabilize cytoplasmic mRNA; sequences that enhance 
translation efficiency (i.e., Kozak consensus sequence); 
sequences that enhance protein stability; and when 
30 desired, sequences that enhance protein secretion. A 
great number of expression control sequences — 
constitutive, inducible and/or tissue-specific - are 
known in the art and may be utilized. 

Preferred DNA vectors also include a marker gene 
35 and means for amplifying the copy number of the gene of 
interest. DNA vectors may also comprise stabilizing 
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sequences (e.g., ori- or ARS-like sequences and telomere- 
like sequences), or may alternatively be designed to favor 
directed or non-directed integration into the host cell 
genome . 

5 Substantial progress in the development of 

mammalian cell expression systems has been made in the 
last decade and many aspects of the system are well 
characterized. A detailed review of the production of 
foreign proteins in mammalian cells, including useful 
10 cells, protein expression-promoting sequences, marker 

genes, and gene amplification methods, is disclosed in M. 
M. Bendig, Genetic Engineering, 7, pp. 91-127 (1988). 

Particular details of the transf ection, 
expression and purification of recombinant proteins are 
15 well documented and are understood by those of skill in 

the art. Further details on the various technical aspects 
of each of the steps used in recombinant production of 
foreign genes in mammalian cell expression systems can be 
found in a number of texts and laboratory manuals in the 
20 art. See, e.g., F. M. Ausubel et al . , ed., Current 

Protocols in Molecular Biology, John Wiley & Sons, New 
York (1989) . 

Useful expression vectors for bacterial hosts 
include bacterial plasmids, such as those from E. coli, 
25 including pBluescript, pGEX-2T, pUC vectors, col El, pCRl, 
pBR322, pMB9 and their derivatives, wider host range 
plasmids, such as RP4, phage DNAs, e.g., the numerous 
derivatives of phage lambda, e.g., NM989, XGT10 and XGT11, 
and other phages, e.g., M13 and Fi lament eous single 
30 stranded phage DNA. Useful expression vectors for yeast 
cells include the 2]i plasmid and derivatives thereof. 
Useful vectors for insect cells include pVL 941. 

In addition, any of a wide variety of expression 
control sequences — sequences that control the expression 
35 of a DNA sequence when operatively linked to it — may be 
used in these vectors to express the DNA sequences of this 
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invention. Such useful expression control sequences 
include the expression control sequences associated with 
structural genes of the foregoing expression vectors. 
Examples of useful expression control sequences include, 
5 for example, the early and late promoters of SV40 or 

adenovirus, the lac system, the £n> system, the £&£ or 1EQ 
system, the T3 and T7 promoters, the major operator and 
promoter regions of phage lambda, the control regions of 
fd coat protein, the promoter for 3-phosphoglycerate 
10 kinase or other glycolytic enzymes, the promoters of acid 
phosphatase, e.g., Pho5, the promoters of the yeast fi- 
mating system, the GAL1 or GAL10 promoters, and other 
constitutive and inducible promoter sequences known to 
control the expression of genes of prokaryotic or 
15 eukaryotic cells or their viruses, and various 

combinations thereof. See, e.g., Th» MalgCUlax PioloqY of 
rtr v, nir ^h a m.vr M (eds. Strathern, Jones and Broach) 
Cold Spring Harbor Lab., Cold Spring Harbor, N.Y. for 
details on yeast molecular biology in general and on yeast 
20 expression systems (pp. 181-209) (incorporated herein by 

reference) ) . 

For eukaryotic cells, expression control 
sequences typically include a promoter, an enhancer 
derived from immunoglobulin genes, SV40, cytomegalovirus, 
25 etc., and a polyadenylation sequence which may include 
splice donor and acceptor sites. 

In a preferred embodiment, DNA sequences of this 
invention are cloned in the expression vector lambda ZAP 
II (Stratagene, La Jolla, CA) , in which expression from 
30 the lac promoter may be induced by IPTG. 

In another preferred embodiment, DNA sequences 
of this invention are inserted in frame into an expression 
vector that allows high level expression of the 
corresponding polypeptide as a fusion protein. Examples 
35 of fusion proteins envisioned by this invention include 
but are not limited to fusions to reporter genes, such as 
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the green fluorescent protein, lacZ and luciferase, and 
fusions to epitope tags such as myc, hemagglutinin (HA) or 
other related short immunogenic sequences. The DNA 
sequences of this invention may be linked by the acid 
5 labile cleavage site (Asp-Pro) to a leader sequence (such 
as the "MLE leader") suitable for promoting expression in 
E. coli. Full-length and truncated forms of the DNA 
sequences of this invention may be prepared as fusion 
constructs. 

10 A wide variety of unicellular host cells are 

useful in expressing the DNA sequences of this invention. 
These hosts may include well known eukaryotic and 
prokaryotic hosts, such as strains of £. coli, 
Pseudomonas, Bacillus, Streptomyces, fungi, yeast, insect 
15 cells such as Spodoptera frugiperda (SF9) , animal cells 
such as CHO and mouse cells, African green monkey cells 
such as COS 1, COS 7, BSC 1, BSC 40, and BMT 10, and human 
cells, as well as plant cells in tissue culture. 

It should of course be understood that not all 
20 vectors and expression control sequences will function 
equally well to express the DNA sequences of this 
invention. Neither will all hosts function equally well 
with the same expression system. However, one of skill in 
the art may make a selection among these vectors, 
25 expression control sequences and hosts without undue 

experimentation and without departing from the scope of 
this invention. For example, in selecting a vector, the 
host must be considered because the vector must be 
replicated in it. The vector's copy number, the ability 
30 to control that copy number, the ability to control 
integration, if any, and the expression of any other 
proteins encoded by the vector, such as antibiotic or 
other selection markers, should also be considered. 

In selecting an expression control sequence, a 
35 variety of factors should also be considered. These 
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include, for example, the relative strength of the 
sequence, its controllability, and its compatibility with 
the DNA sequence of this invention, particularly with 
regard to potential secondary structures. Unicellular 

5 hosts should be selected by consideration of their 

compatibility with the chosen vector, the toxicity of the 
product coded for by the DNA sequences of this invention, 
their secretion characteristics, their ability to fold the 
polypeptide correctly, their fermentation or culture 

10 requirements, and the ease of purification from them of 
the products coded for by the DNA sequences of this 
invention . 

Within these parameters, one of skill in the art 
may select various vector/expression control sequence/host 
15 combinations that will express the DNA sequences of this 
invention on fermentation or in other large scale 
cultures . 

Given the strategies described herein, one of 
skill in the art can construct a variety of vectors and 
20 nucleic acid molecules comprising functionally equivalent 
nucleic acids. DNA cloning and sequencing methods are 
well known to those of skill in the art and are described 
in an assortment of laboratory manuals, including Sambrook 
et al, sirora , 1989; and Current Protocols , in Nol ftCWlar 
25 Bifllcsy., F.M. Ausubel et al., eds., Current Protocols, a 
joint venture between Greene Publishing Associates, Inc. 
and John Wiley & Sons, Inc. (1994 Supplement). Product 
information from manufacturers. of biological, chemical and 
immunological reagents also provide useful information. 

30 Recombinant Mammalian RCE and AFC Polypeptides and 
Antibodies Directed Against Them. 

The recombinant DNA molecules and more 
particularly, the expression vectors of this invention may 
be used to express recombinant RCE and AFC polypeptides in 
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a heterologous host cell. The RCE and AFC polypeptides of 
this invention may be full-length or less than full-length 
recombinant polypeptide fragments recombinantly expressed 
from the DNA sequences according to this invention. Such 
5 polypeptides include variants, fragments and muteins 
having biological activity. The polypeptides of this 
invention may be soluble, or may be engineered to be 
membrane- or substrate-bound using techniques (such as 
cross-linking) well known in the art. The polypeptides of 
10 this invention may be fused to other molecules, such as 
genetic/ enzymatic or chemical or immunological markers 
such as epitope tags. Fusion partners include, inter 
alia, GST, immunoglobulins, bacterial beta-galactosidase, 
trpE, protein A, p-lactamase, a amylase, maltose binding 
15 protein, alcohol dehydrogenase, polyhistidine (for 
example, six histidine at the amino and/or carboxyl 
terminus of the polypeptide), green fluorescent protein 
(GFP) , yeast a mating factor, GAL4 transcription 
activation or DNA binding domain, and lucif erase. See, 
20 e.g., Godowski et al., 1988. 

Fusion proteins will typically be made by either 
recombinant nucleic acid methods, as described above, or 
may be chemically synthesized. Techniques for the 
synthesis of polypeptides are described, for example, in 
25 Merrifield, 1963. 

Transformation and other methods of introducing 
nucleic acids into a host cell (e.g., transfection and 
viral infection) can be accomplished by a variety of 
methods which are well known in the art. Animal (and 
30 yeast) cells may be transfected with a plasmid, a cosmid 
or the like, infected with a viral vector of this 
invention for transient or stable expression of RCE or AFC 
polypeptides. For example, for transient expression in 
animal cells, investigators have used spheroplast fusion, 
35 DEAE dextran, and electroporation. For stable expression 
they have used calcium phosphate, spheroplast fusian,_^and 
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electroporation. 

The polypeptide gene products of the mammalian 
RCE or AFC genes of this invention may be used to produce 
polyclonal or monoclonal antibodies using a variety of 
techniques well known to those of skill in the art. 
Alternatively, peptides corresponding to specific regions 
of the RCE or AFC polypeptides may be synthesized and used 
to create immunological reagents according to well known 
methods . 

Antibodies directed against the mammalian RCE 
and AFC polypeptides of this invention are immunoglobulin 
molecules or portions thereof that are immunologically 
reactive with at least one novel mammalian RCE or AFC 
polypeptide of the present invention. It should be 
understood that the antibodies of this invention include 
antibodies immunologically reactive with fusion proteins 
and multimeric proteins comprising a novel mammalian RCE 

or AFC polypeptide. 

Antibodies directed against a novel mammalian 
RCE or AFC polypeptide may be generated by immunization of 
a mammalian host. Such antibodies may be polyclonal or 
monoclonal. Preferably they are monoclonal. Methods to 
produce polyclonal and monoclonal antibodies are well 
known to those of skill in the art. For a review of such 
methods, see Antibodies, A Laboratory Manual, svpra, and 
D.E. Yelton, et al. f firni F*v . of fiior.hem. , 50, pp. 657-80 
(1981) . Determination of immunoreactivity with a novel 
mammalian RCE or AFC polypeptide of this invention may be 
made by any of several methods well known in the art, 
30 including by immunoblot assay and ELISA. _ g 
Monoclonal antibodies with affinities of 10 
M" 1 or preferably 10" 9 to lO -10 M _1 or stronger are 
typically made by standard procedures as described, e.g., 
in Harlow and Lane, 1988 or Coding, 1986. Briefly, 
35 appropriate animals are selected and the desired 
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immunization protocol followed. After the appropriate 
period of time, the spleens of such animals are excised 
and individual spleen cells fused, typically, to 
immortalized myeloma cells under appropriate selection 
5 conditions. Thereafter, the cells are clonally separated 
and the supernatants of each clone tested for their 
production of an appropriate antibody specific for the 
desired region of the antigen. 

Other suitable techniques involve in vitro 
10 exposure of lymphocytes to the antigenic polypeptides, or 
alternatively, to selection of libraries of antibodies in 
phage or similar vectors. See Huse et al., 1989. The 
polypeptides and antibodies of the present invention may 
be used with or without modification. Frequently, 
15 polypeptides and antibodies will be labeled by joining, 
either covalently or non-covalently, a substance which 
provides for a detectable signal. A wide variety of 
labels and conjugation techniques are known and are 
reported extensively in both the scientific and patent 
20 literature. Suitable labels include radionuclides, 

enzymes, substrates, cof actors, inhibitors, fluorescent 
agents, chemiluminescent agents, magnetic particles and 
the like. Patents teaching the use of such labels include 
U.S. Patents 3,817,837; 3,850,752; 3,939,350; 3,996,345; 
25 4,277,437; 4,275,149 and 4,366,241. Also, recombinant 
immunoglobulins may be produced (see U.S. Patent 
4,816,567). 

An antibody of this invention may also be a 
hybrid molecule formed from immunoglobulin sequences from 

30 different species (e.g., mouse and human) or from portions 
of immunoglobulin light and heavy chain sequences from the 
same species. It may be a molecule that has multiple 
binding specificities, such as a bifunctional antibody 
prepared by any one of a number of techniques known to 

35 those of skill in the art including: the production of 
hybrid hybridomas; disulfide exchange; chemical cross- 
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linking; addition of peptide linkers between two 
monoclonal antibodies; the introduction of two sets of 
immunoglobulin heavy and light chains into a particular 

cell line; and so forth. 

The antibodies of this invention may also be 
human monoclonal antibodies, for example those produced by 
immortalized human cells, by SCID-hu mice or other non- 
human animals capable of producing "human" antibodies, or 
by the expression of cloned human immunoglobulin genes. 
0 in sum, one of skill in the art, provided with 

the teachings of this invention, has available a variety 
of methods which may be used to alter the biological 
properties of the antibodies of this invention including 
methods which would increase or decrease the stability or 
5 half-life, immunogenics, toxicity, affinity or yield of 
a given antibody molecule, or to alter it in any other way 
that may render it more suitable for a particular 
application. 

Assays to Identify Inhibitors of Mammalian CAAX-Processing 
20 Enzymes 

This invention is directed to methods for 
identifying inhibitors of mammalian CAAX processing 
enzymes by la vitro enzyme inhibition assays using the 
polypeptide expression products of the mammalian RCE and 
AFC sequences. In particular, this invention provides a 
heterologous host cell or organism comprising a 
recombinant DNA molecule (vector) capable of expressing a 
mammalian gene encoding Rcelp or Afclp. A host cell that 
overexpresses Rcelp or Afclp is also provided. One 
preferred host organism is a yeast such as Saccharomyces 
transformed with a high copy plasmid encoding a RCE or AFC 
gene. The resulting recombinant organism represents an 
enriched source for the corresponding mammalian enzyme, 
inclusion of such an extract in an i n vitro CAAX-protease^ 
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assay allows the identification of inhibitors that can 
reduce or eliminate the mammalian CAAX protease activity. 

One method for isolating a protein inhibitor (or 
an activator) of a mammalian CAAX protease is the "two- 
5 hybrid system" (see, e.g., C.T. Chien et al., "The 

two-hybrid system: a method to identify and clone genes 
for proteins that interact with a protein of interest", 
Prnr. Natl. Ar.ri. Sri. U.S.A., 88(21), pp. 9578-82 (1991)) 
(incorporated herein by reference) (Example 6) . 

10 This invention is also directed to methods for 

identifying inhibitors of mammalian CAAX processing 
enzymes by screening host cells transformed with the DNA 
sequences of this invention in in vivo biological activity 
assays, including but not limited to those described 

15 herein. 

One preferred in vivo method to identify 
inhibitors of Rcelp or Afclp first requires identifying 
genes whose expression requires the presence of the 
appropriate protease. A gene expression reporter 
20 construct may be created and introduced into cells. Such 
a cell line or strain will allow indirect monitoring of 
protease function. Incubation of these cells with select 
candidate protease inhibitors or a combination of 
candidate inhibitors such as a chemical library 
25 facilitates identification of chemicals that alter 
protease activity in vivo. 

Another preferred in vivo method to identify 
inhibitors of Rcelp or Afclp capitalizes on the extreme 
sensitivity of co-culture experiments to identify 
30 differences in the relative growth rates (or fitness) of 
organisms. Using this approach, two strains of cells 
which may possess only a subtle difference are mixed and 
grown for a number of generations. At various points of 
the growth period, portions of the co-culture are analyzed 
35 to determine the relative proportion of each constituent 
strain.- Very small differences in growth rates between 
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two strains will result in the more fit strain becoming 
the predominant, if not the exclusive, strain in the co- 
culture after a number of generations. Co-cultures may be 
grown and the cells diluted into fresh medium to 
continually increase the number of cell generations in 
culture, enabling even very subtle differences in growth 
rates to be measured. 

Such co-culture experiments may be used to 
identify compounds that are competitive inhibitors of 
Rcelp or Afclp (Example 8) . Competitive inhibition of an 
enzyme can be overcome by increasing the amount of enzyme. 
Therefore, a strain that harbors a significant amount of 
Rcelp (or Afclp) will be less sensitive to the effects of 
a competitive inhibitor than will be a strain with a 
15 normal amount of Rcelp (or Afclp) . 

In one embodiment, a yeast strain is constructed 
(Strain 1) that harbors a high-copy (2u) plasmid that 
encodes the RCE1 gene. Preferably, the RCE1 gene is 
driven by a very strong promoter, such as that of the ADHl 
gene. This RCE1 overexpressing strain also possesses, 
preferably (but not necessarily) on the same high-copy 
plasmid, a marker gene the presence of which (i.e., the 
gene or gene product) can be easily detected, such as the 
green fluorescent protein (GFP) . A second strain (Strain 
25 2) is required for the co-culture experiment and is 

preferably Strain 1 without the high-copy plasmid carrying 
the RCE1 gene (and optionally comprising a marker gene). 

Strain 1 and Strain 2 are grown independently to 
mid-log phase and then mixed at a selected ratio to create 
an appropriate co-culture. Preferably, the mixture will 
contain a large excess of Strain 2 relative to Strain 1 
(e.g., 5,000:1; see Example 8) . Because Strain 1 (which 
harbors the multi-copy plasmid encoding RCE1 and GFP) is 
present in such low abundance, the overall mixture will 
35 not exhibit significant fluorescence. During growth of 
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the co-culture, if a situation were encountered where 
Strain 1 has a growth advantage over Strain 2, the 
relative abundance of Strain 1 would increase and thereby 
cause an easily identified increase in fluorescence of the 
5 mixture. 

The co-cultures are aliquoted into separate 
growing chambers (e.g., the wells of 96-well plates). To 
identify inhibitors of Rcelp, a selected concentration or 
range of concentrations from selected putative inhibitors 
10 (e.g., portions of a chemical library) are added to 

individual wells of 96-well plates which contain aliquots 
of the co-culture. Both of these steps lend themselves 
well to automation and large numbers of chemicals may be 
tested in parallel. 
15 The co-cultures containing added test chemicals 

are incubated until the cells have divided numerous times. 
At various points during the growth period, the relative 
abundance of Strain 1 in the mixture may be assessed by 
fluorescence emission from the 96-well plates. 
20 The cells are diluted into fresh medium and 

allowed to continue to grow. This regimen of growth 
followed by dilution into fresh medium may be repeated 
until the cells have experienced multiple generation 
doublings (e.g./ 2-20). Preferably, the cells undergo 10 
25 generation doublings. More preferably, the cells undergo 
5 generation doublings. The preferred number of 
generation doublings of the co-culture may be determined 
as described in Example 8. Preferably, the concentrations 
and ratios of Strains 1 and 2 in the beginning co-culture 
30 are selected to avoid the need to dilute the cultures, 
which involves more work and time. 

In a more preferred embodiment of the above 
assay, Strain 1 and Strain 2 do not express (e.g., are 
deleted for) the endogenous RAS1 gene and possess the 
35 ras2-23 allele in place of RAS2. The ras2-23 allele, for 
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example, is temperature sensitive and does not adequately 
function at 37°C to confer viability to a rasl::HIS3 .train 
CH Mitsuzawa et al., Sortie* 123, pp. 739-748 (1989)). 
At a semi-permissive temperature (34<>C), growth of this 
5 strain depends upon the function of Rcelp (V.L. Boyartchuk 
J et al., ttltDC* 275, pp. 1796-1800 (1997,,. Therefore, 
the presence of the ras2-23 allele, in the absence of 
BMi, sensitizes Strain 1 and Strain 2 such that their 
growth rates depend upon the function of Rcelp. 



10 



15 



20 



25 



30 



Utility of Mammalian RCE and AFC Nucleic Acid Sequences, 
Polypeptides, and Antibodies And Inhibitors Thereof 

The sequences of hRCE (SEQ ID N0:1, , mRCE (SEQ 
ID NO:3 or SEQ ID NO:5) and hAFC (SEQ ID NO:37, - in 
their entirety or fragments thereof - may be used xn 
hybridization experiments to: A) determine RCE expression 
ievels in various tissues, different developmental or 
disease states or other conditions; B) isolate and clone 
related genes, mRNA or cDNAs which hybridize to those 
sequences; and C, isolate related or identical mRNA 
sequences for the purpose of in_vit^ translation. The 
sequences comprising hRCE, mRCE or hAFC may also be used 
to construct fusion proteins in order to address the 
localization of each of these proteins in_viv*. Such 
fusion proteins may additionally contain a reporter gene, 
such as the green fluorescent protein, lacZ or lucif erase. 

Alternatively, such sequences may be fused to an 
epitope tag, such as myc, hemagglutinin (HA, or any other 
related short sequences. The above DNA sequence 
manipulations may be accomplished using methods which are 
well known to those of skill in the art. 

RCE and AFC sequences may be used to construct 
vectors that express protein products having biological 
activity (e.g., CAAX processing, in a heterologous 
organism, preferably, in aunicellular organism. 
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Expression products may be purified by conventional means 
which are well known to those of skill in the art. 
Mammalian RCE and AFC DNA sequences or their expression 
products may be purified and then immobilized to beads or 
5 solid support matrices which can then be used to isolate 
and to identify cellular factors which bind to those DNA 
or amino acid sequences. 

The hRCE, mRCE or hAFC sequences (in their 
entirety or fragments thereof) may be expressed in a 
10 heterologous organism to: 

A) examine many biochemical features of the protein 
such as catalytic mechanism, Kcat, Km, Vmax, and 
oligomeric status; 

B) determine the substrate specificities of the CAAX 
15 processing enzyme such as the tolerance of non-aliphatic 

residues at the Al and A2 position, and permissive and 
non-permissive substitutions at the X position, lipid 
specificities attached to the cysteine and amino acid 
sequence specificities N-terminal of the cysteine; 
20 C) create an in vitro assay for the purpose of 

identifying and characterizing inhibitors of the protease; 

D) create antibodies (either polyclonal or 
monoclonal), and hybrid, chimeric and humanized 
antibodies, for the purpose of localization of protein or 

25 to selectively inactivate protease activity; 

E) assess the functional importance of Ras CAAX 
processing in yeast. Yeast which express an activated 
allele of Ras display a heat shock sensitivity when 
exposed to an acute treatment at 55°C. The endogenous 

30 protease gene products may be inactivated (e.g., by 

mutation of the AFC1 and RCE1 genes) by standard methods, 
which causes a suppression of the heat-shock sensitivity. 
Normal, mutant and altered forms of the hRCE or mRCE cDNAs 
may be expressed in yeast cells lacking endogenous AFC1 

35 and RCE1 activities (e.g., the deletion strains described 
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herein) and the yeast cells tested for relative 
sensitivity to heat-shock. 

In addition, hRCE, mRCE or hAFC sequences may be 
used to create an antisense vector. The antisense vector 
may be used to synthesize an RCE or AFC antisense molecule 
to specifically inactivate gene expression and thereby to 
inactivate the production of the RCEp or AFCp proteins. 
Such vectors could be constructed such that it is operably 
linked and under the control of an inducible, tissue 
specific, developmental or constitutive promoter. Thus, 
RCE or AFC activity could be selectively turned on or off 
at desired iocations that would therapeutically effect a 
CAAX processing-mediated or affected diseased state or 
disorder in a patient. 
15 Similarly, the hRCE, mRCE or hAFC sequences of 

this invention may be used to construct specific ribozymes 
that are directed to the endogenous RCE or hAFC mRNAs. 
Methods for constructing various types of ribozymes are 
published and may be performed by those of skill in the 
art using the RCE or hAFC sequences described herein. 

The mouse mRCE cDNA sequences of this invention 
may be used to construct a genetic knockout in a mouse. 
The genetically altered mouse may in turn be used as a 
mammalian model for human diseases and disorders involving 
25 proteins processed by the RCE gene product. Such 
genetically altered mice will thus be useful in 
determining the functional importance of mRCE in various 
mammalian disease states and disorders, and to assess the 
therapeutic value of inhibitors of mRCE or other 
therapeutic agents on the diseased state in mouse. An 
example of one such diseased state is the tumorigenesis 

induced in mice by expressing activated forms of Ras 

vall2 v vall2 
protein (H-ras or K-ras J . 

The antibodies of this invention have a variety 
of uses. For example, they are useful as reagents to 



20 



30 



35 



SUBSTITUTE SHEET (RULE 26) 



WD 98/54333 



- 47 - 



PCT/US98/11415 



screen for expression of mammalian RCE or AFC 
polypeptides, either in expression libraries constructed 
from mammalian DNA sources or from other samples in which 
the proteins may be present. Moreover, by virtue of their 
5 specific binding affinities, the antibodies of this 

invention are also useful to purify or remove polypeptides 
from a given sample, and to block or bind to specific 
epitopes on the polypeptides. 

Such immunological reagents may be used for a 
10 variety of applications, some examples of which are below. 

1. Determine protein levels of Rcelp and Afclp under 
different conditions in a particular tissue. The assay 
may be performed by an immobilization-based protocol such 
as an immunoblot where the proteins are resolved by 

15 SDS-PAGE and transferred to a solid support or by an ELISA 
assay. Such an approach would reveal steady state levels 
of the proteins. The synthesis rate and protein stability 
of the respective proteins could be determined by a 
pulse-chase experiment comprising an immunoprecipitation 

20 step using an antibody of this invention. 

2. Determine the apparent molecular weights of Rcelp 
and Afclp. An immunoblot of protein extracts that have 
been resolved by SDS-PAGE may be used to identify the size 
of the proteins when they are expressed in a particular 

25 cell type or tissue source or when they are expressed 
recombinantly in, for example,. a unicellular host. 
Deviations from the molecular weight predicted from the 
amino acid sequence may 

reveal polypeptide products from alternatively spliced 
30 mRNAs or post-translational processing events such as 

proteolysis, glycosylation, lipidation or phosphorylation. 

3. Subcellular localization of RCE and AFC 
polypeptides. Tissue~or cell extracts may be fractionated 
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into cellular components, the proteins resolved by 
electrophoresis and detected using the immunological 
reagents of this invention. Association of the protein 
with a specific cellular organelle may reveal the sub- 

5 cellular localization of the protease. Alternatively,, 
antibodies directed against Rcelp or Afclp could be 
labeled with a fluorescent tag such as fluorescein and 
incubated with permeabilized cells in culture. 
Fluorescent microscopy could then be employed to identify 

10 the localization of the protein in vivo . 

4. Determining the functional requirement of Ras 
C-terminal proteolysis. Micro- inject ion of activated 
forms of Ras into NIH-3T3 cells or Xenopus oocytes results 
in a rapid and quantifiable phenotype for assaying Ras 
15 function (Schafer et al., Science/ 245, pp. 379-385 

(1989)). Co-injecting an Rcelp inactivating antibody with 
an unprocessed form of activated Ras would allow the 
determination of the functional requirement of Ras 
proteolytic processing for Ras biological function. 

5. Identifying the topology of Rcelp or Afclp within 
a lipid bilayer. Monoclonal antibodies directed against 
discrete regions of either polypeptide could be incubated 
with permeabilized cells or membrane fractions to map 
regions of the respective proteins that are cytosol-facing 
25 or cytosol-excluded. Cytosol-facing domains would be 
accessible for interaction with epitope-specif ic 
antibodies and could be identified by fluorescent 
microscopy or immunoblotting techniques. 



20 



30 



6. Confirming protein-protein interactions. 
Candidate Rcelp- or Af clp-interacting proteins identified 
genetically or by an assay such as the "two-hybrid screen" 
could be tested for their ability to interact in vitro . 
-eel-ls-ectopically expressing the- candidate protein and-the 
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respective protease could be cross-linked and 
immunoprecipitated with protein- specific antisera. Once 
precipitated, the complexes could be resolved by SDS-PAGE 
and immunoblotted with antisera 
5 directed against the second member of the putative 
complexes . 

All of the above utilities apply to the 
corresponding use of the AFC sequences, polypeptides and 
inhibitors and antibodies of those polypeptides according 
10 to this invention whether or not specifically referred to 
above. They also apply to the corresponding use of 
related sequences which may be identified, for example, by 
hybridization to the sequences of this invention, and 
their corresponding expression products. 
15 This invention is also directed to methods for 

identifying inhibitors of mammalian CAAX processing 
enzymes by screening host cells transformed with the DNA 
sequences of this invention in in vivo biological activity 
assays such as those described herein, or by in Vitro 
20 enzyme inhibition assays using the polypeptide expression 
products of the mammalian RCE and AFC sequences. 
Inhibitors of the mammalian CAAX processing enzymes will 
be useful as therapeutics for treating certain CAAX 
protein-mediated diseased states or disorders in a 
25 patient, for example, for treating cancers or tumors 

correlated to Ras activation. In the therapeutic method 
of this invention, a small molecule inhibitor of a 
mammalian Rcelp or Afclp protein is administered to a 
patient suffering from a CAAX protein-mediated disease or 
30 disorder. 

The inhibitors of mammalian CAAX processing 
enzymes of this invention may be formulated into 
pharmaceutical compositions and administered in vjVQ at an 
effective dose to treat the particular clinical condition 
35 addressed. Determination of a preferred pharmaceutical 
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formulation and a therapeutically efficient dose regent 
for a given application is within the skill of the art 
taking into consideration, for example, the condition and 
weight of the patient, the extent of desired treatment and 
the tolerance of the patient for the treatment. 

Administration of the inhibitors of mammalian 
CAAX processing enzymes of this invention, including 
isolated and purified forms, their salts or 
pharmaceutical^ acceptable derivatives thereof, may be 
accomplished using any of the conventionally accepted 
mod es of administration of agents which are used to treat, 
for example, tumors and cancers. 

The pharmaceutical compositions of this 
invention may be in a variety of forms, which may be 
selected according to the preferred modes of 
administration. These include, for example, solid, semi- 
solid and liquid dosage forms such as tablets, pills, 
powders, liquid solutions or suspensions, suppositories, 
and injectable and infusible solutions. The preferred 
form depends on the intended mode of administration and 
therapeutic application. Modes of administration may 
include oral, parenteral, subcutaneous, intravenous, 
intralesional or topical administration. 

The inhibitors of this invention may, for 
example, be placed into sterile, isotonic formulations 
with or without cofactors which stimulate uptake or 
stability. The formulation is preferably liquid, or may 
be lyophilized powder. For example, the inhibitors may be 
diluted with a formulation buffer comprising 5.0 mg/ml 
30 citric acid monohydrate, 2.7 mg/ml trisodium citrate, 41 
mg /ml mannitol, 1 mg/ml glycine and 1 mg/ml polysorbate 
20 This solution can be lyophilized, stored under 
refrigeration and reconstituted prior to administration 
with sterile Water-For- Injection (USP) . 
35 The compositions also will preferably include^ 

.-- conventional pharmaceuticaliy acceptable carri«fs-^el-l 
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known in the art (see for example Remington's 
Pharmaceutical Sciences, 16th Edition, 1980, Mac 
Publishing Company) . Such pharmaceutical^ acceptable 
carriers may include other medicinal agents, carriers, 
5 genetic carriers, adjuvants, excipients, etc., such as 
human serum albumin or plasma preparations. The 
compositions are preferably in the form of a unit dose and 
will usually be administered one or more times a day. 
The pharmaceutical compositions of this 
10 invention may also be administered using microspheres, 
liposomes, other microparticulate delivery systems or 
sustained release formulations placed in, near, or 
otherwise in communication with affected tissues or the 
bloodstream. Suitable examples of sustained release 
15 carriers include semipermeable polymer matrices in the 
form of shaped articles such as suppositories or 
microcapsules. Implantable or microcapsular sustained 
release matrices include polylactides (U.S. Patent No. 
3,773,319; EP 58,481), copolymers of L-glutamic acid and 
20 gamma ethyl-L-glutamate (Sidman et al., BiQPQlynierS/ 22, 
pp. 547-56 (1985)); poly (2-hydroxyethyl-methacrylate) or 
ethylene vinyl acetate (Langer et al., J, BiQflied» Mater. 
Res . , 15, pp. 167-277 (1981); Langer, Chenu Tech./ 12, pp. 
98-105 (1982) ) . 
25 Liposomes containing inhibitors of CAAX 

processing enzymes may be prepared by well-known methods 
(See, e.g. DE 3,218,121; Epstein et al., PrQC. Natl. Acad» 

, U.S.A. . 82, pp. 3688-92 (1985); Hwang et al., PrOC. 
watl. Acad - Sri. U.S.A.. 77, pp. 4030-34 (1980); U.S. 
30 Patent Nos. 4,485,045 and 4,544,545). Ordinarily the 
liposomes are of the small (about 200-800 Angstroms) 
unilamellar type in which the lipid content is greater 
than about 30 mol.% cholesterol. The proportion of 
cholesterol is selected to control the optimal rate of MAG 
35 derivative and inhibitor release. 

— The inhibitors-of mammalian CAAX processing 
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enzymes of this invention may also be attached to 
liposomes, which may optionally contain other agents to 
aid in targeting or administration of the compositions to 
the desired treatment site. Attachment of inhibitors to 

5 liposomes may be accomplished by any known cross-linking 
agent such as heterobi functional cross-linking agents that 
have been widely used to couple toxins or chemotherapeutic 
agents to antibodies for targeted delivery. Conjugation 
to liposomes can also be accomplished using the 

10 carbohydrate-directed cross-linking reagent 4-(4- 

maleimidophenyl) butyric acid hydrazide (MPBH) (Duzgunes 
et al., |T r rV Riorum. Abst. Suppl. 16E 77 (1992)). 

The following are examples which illustrate the 
invention. These examples should not be construed as 
15 limiting: these examples are included for purposes of 
illustration only. 

Example 1: Cloning and Sequencing of the Human hRCB cDNA 

E solatisa of L2SZ22B xatn clones 

To isolate the human cDNA counterpart to the 

20 yeast RCE1 gene, we screened a Agtll cDNA library made 

from a 9-week old human fetus. The library (ATCC #77433) 
was purchased from the American Type Culture Collection. 
Approximately 6 x 10 5 phage were plated onto the host 
caii strain LE392 and lifted onto Biodyne nylon membranes 

25 (ICN) by standard methodology (see, e.g., Sambrook et al., 
Molecular Cloning - A Laboratory Manual [2- ed.] Vol. 1-3 
(1989) ) . 

A plasmid vector containing the human EST 
(ID#358628) with similarity to the yeast Rcelp protein was 
30 purchased from Research Genetics and served as a probe. 
The EST was recovered from the vector by digesting with 
EcoRI restriction enzyme and agarose gel purific ation. 
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The EST, approximately 750 base pairs in length, was 
32 

labeled with P by random priming and hybridized to the 
phage lifts at high stringency according to the nylon 
membrane manufacturers specifications at 65°C for 18 hours. 
5 The hybridization buffer consisted of 5X Denhardt' s 

solution, 5X SSC, 0.1% SDS, and 100 yg/ml denatured salmon 
sperm DNA. The filters were washed at high stringency at 
65°C in 0.1% SDS and 0.1% SSC and exposed to 
autoradiography . 
10 Two plaques, 13R and 27R, which showed a strong 

hybridization signal to the human EST (#358628) were 
purified to homogeneity by two successive rounds of 
plating and hybridization according to standard 
procedures . 

15 Phage stocks were prepared by overlaying plates 

of E. coli strain LE392 that had been infected with Agtll 

clones 13R and 27R with 20 mis SM. After gentle shaking 

at room temperature, approximately 12 mis was drawn off 

each plate, the cells were pelleted, and the stocks were 

20 stored at 4°C with a few drops of chloroform. The stocks 

3 

were titered with LE392 and determined to be 2.6x10 
pfu/ul (13R) and 1.6xl0 3 pfu/ul (27R) . 

To prepare DNA from these two clones, LE392 
cells (fresh overnight culture concentrated in 40% volume 
25 in SM) were infected with 100,000 pfu of 13R and 27R. The 
infection was incubated for 30 minutes at 42°C, plated on 
large LB plates with top agar, and incubated at 42°C 
overnight. The plates were then overlayed with 15 mis X 
diluent and allowed to shake gently for 2 hours at room 
30 temperature. The X diluent was then drawn off and the 
cells pelleted out. DNA was then prepared following 
standard procedures (see, e.g., Sambrook et al., Molecular 
Cloning ~ A Laboratory Manual [2 nd ed.] Vol. 1-3 (1989)), 
and resuspended in TE. 

35~ PCR Ampli fi-ration 
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The 5' ends of the X clones were PCR amplified 
using two different internal primers (608RI, 5'- 
GGTTGCGTACAGACACTCCGTC-3' (SEQ IDNO:39); RCE 1R-RI, 5'- 
TTATTATACCACTTTTACCCG- 3 ' (SEQ ID NO: 40)) and an external X 
5 primer (OMS178, 5' -TGGCGCAGGTAGCAGAG-3' (SEQ ID NO: 41)). 
using Vent DNA polymerase) . The PCR reactions were then 
treated with Taq DNA polymerase for 10 minutes at 72°C to 
produce A overhangs. The reactions were extracted with 
phenol/chloroform, precipitated with NaOAc/EtOH and 
10 resuspended in TE. The Invitrogen® Original TA Cloning® 
kit was used to clone the PCR products into P CR® 2.1 and 
transformed into INVaF' . Five clones containing inserts 
were identified: 13R/608-178-1, -3, 13R/RCE-178-3, 
27R/608-178-1, -3. These five clones were sequenced using 

15 the ABI 377 sequencer. 

The 3' ends of the A clones were subcloned by 
PCR amplifying the ends with an internal primer (8F, 5' - 
CTGCCTCACAGACATGCGTTG- 3 ' (SEQ ID NO: 42)) and an external X 
primer (OMS179, 5' -GCGAAATACGGGCAGACATGG-3' (SEQ IDNO:43)) 
20 using Tag DNA polymerase. The Invitrogen® Original TA 

Cloning® kit was used to clone the PCR products into P CR® 
2 1 and transformed into INVaF' . Three clones containing 
inserts were identified: 13R/8F-179-1, 27R/8F-179-1, -2. 
These three clones were sequenced using the ABI 377 
25 sequencer . 

Additional sequence information for the human 
hRC E cDNA was obtained by 5' and 3' RACE methods (Siebert 

et ai., mnn- — aa—aa* 23. PP . 1087-1088 (1995)). 

Adaptor-ligated cDNA constructed from total human fetus 
30 mRNA (#7438-1), human placenta (#7411-1) and murine 15-day 
embryo (#7459-1) were purchased from Clonetech (Palo Alto, 
CA) . Reactions were carried out under the recommended 
conditions of the manufacturer which included the 
following PCR conditions: IX KlenTaq PCR buffer 
35 (Clonetech), 200uM dNTPs, 200nM oligonucleotide primers, 
- io pg/ul cDNA, and 5 units of KlenTaq polymerase in a-50uir 
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reaction volume in the presence or absence of 5% DMSO. ... 
All PGR reactions included the primer API (Clonetech) 
which is complementary to the cDNA adapter sequence and a 
gene specific primer. PGR reactions with human cDNA 

5 included one of the gene specific primers hRCE(lR) (SEQ. ID 
NO: 7), hRCE (2R) (SEQ ID NO: 8), hRCE (10R) (SEQ ID NO: 16) or 
hRCE(15R) (SEQ ID NO: 21) for 5' RACE reactions and 
hRCE (3F) (SEQ ID NO: 9) or hRCE (4F) (SEQ ID NO: 10) for 3 f 
RACE reactions. 5' RACE reactions of murine cDNA included 

10 API (Clonetech) and one of the following gene specific 

primers: 608 (1R) (SEQ ID NO:27), 608 (2R) (SEQ ID NO:28) or 
hRCE(lOR) (SEQ ID NO: 16) . All PCR reactions were performed 
in an ABI 9600 thermocycler . 5' and 3' RACE products were 
resolved by agarose gel electrophoresis, excised and 

15 recovered. The DNA products were subcloned into the T/A 
cloning vector, pCR2.1 (Invitrogen f Carlsbad, CA) and 
sequenced. 

Example 2: Human hRCE RNA Analysis 

Northern Riot Analysis 

20 Two multiple tissue mRNA blots (#7760-1, #7759- 

1) were purchased from Clonetech and hybridized to the 

32 

human EST (ID#358628) uniformly labeled with P as 
described in Example 1. Hybridization and washing were 
done at high stringency at 65°C using the conditions 
25 recommended by Clonetech. 

p^i™gr Extension Analysis 

5 pg human poly (A) RNA from pancreas (Clonetech, 

32 

catalog # 6539-1) was hybridized to 0.9 pmol of P end- 
labeled primer (15R, 5' -AGGCTGCCCACGTAGGAGCAGGCGAGGCTGA-3' 
30 (SEQ ID NO: 21) 2.6xl0 6 cpm/pmol) in hybridization buffer 

_(2 mM-Tris [pH 7.6], 0.2 mM EDTA, 250 mM KC1) in a volume. 

of 10 jil. The reaction mixture was placed at 73°C for 2 
minutes and then at 53°C for 1 hour. Extension of the 
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primer was initiated by the addition of 40 pi of reverse 
transcriptase buffer (20 mM Tris (pH 8.7], 10 mM MgCl 2 , 
100 ug/ml actinomycin D, 5 mM dithiothreitol, 0.33 mM 
deoxynucleoside triphosphates) and 200 units 

5 superscripts I (GibcoBRL, catalog « 18064-014). The . 
extension reaction was placed at 37°C for 30 minutes and 
then terminated with the addition of 300 ul ethanol. The 
nucleic acids were pelleted and resuspended in 5 ul 
formamide loading buffer/0.1 M NaOH (2:1). The reaction 

10 products were then separated on an 8% polyacrylamide/urea 
sequencing gel, and visualized by autoradiography. The 
predominant extension product was approximately 41 

nucleotides long. 

To control for the possibility of secondary 
15 structures at the 5' end of the hRCEl mRNA affecting the 
primer extension analysis, the reactions were performed 
utilizing the GeneAmp® Thermostable rTth Reverse 
Transcriptase RNA PCR kit (Perkin Elmer, catalog # N808- 
0069). Human Poly (A) RNA (5 ug) from pancreas (Clontech, 
20 catalog # 6539-1) and placenta (Clontech, catalog # 6518- 
1) were separately hybridized to 0.9 pmol of P end- 
labeled primer (15R, 2.6X10 6 cpm/pmol (SEQ ID 110:21) ) in 
rTth reverse transcriptase buffer (10 mM Tris [pH 8.3], 90 
mM KC1) in a volume of 14.4 ul. The hybridization was 
25 carried out at 70°C for 15 minutes. 5.6 ul extension mix 
(final concentrations: 1 mM MnCl 2 , 1 mM dATP, 1 mM dCTP, 1 
mM dGTP, 1 mM dTTP, 5 units rTth DNA polymerase) was added 
to the hybridization reactions and incubated at 70°C for 20 
minutes. The reactions were terminated with the addition 
30 of 120 ul ethanol and the nucleic acids pelleted. The 
products were resuspended in 5 ul formamide loading 
buffer/0.1 M NaOH (2:1). In order to size the primer 
extension products double stranded sequencing reactions 
were performed using 15R (SEQ ID NO: 21) as the primer and 
35 a subclone of the A clone 27R (27R/608-178-1, see ab ove) ■ 
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The sequencing reactions were performed utilizing the 
Sequenase Version 2.0 DNA Sequencing Kit (United States 
Biochemical/ catalog # US70770) as per the kit 
instructions. The primer extension products and the 
5 sequencing reactions were resolved on an 8% 

polyacrylamide/urea sequencing gel followed by 
autoradiography • 

Example 3: Expression of the human hRCE cDNA in 

10 a yeast mutant 

The ORF beginning at position 116 of the hRCE 
gene (SEQ ID NO:l, nucleotides 116 to 997; Figure 2A) was 
expressed in a yeast strain deleted for both CAAX protease 
genes, yAFCl and yRCEl (Boyartchuk et al., supra ) (Example 

15 3) . To create the expression construct, we amplified by 
PCR the human hRCE ORF from the first valine (defined by 
nucleotides 116-118) to the stop codon (defined by 
nucleotides 995-997) employing the human hRCE cDNA A clone 
27R as a template. The primers included in the reaction 

20 were hRCE-ValX (SEQ ID NO:22) and hRCE3P-RI (SEQ ID 
NO: 23). The primer hRCE-ValX incorporated an Xbal 
restriction site immediately upstream (5') of the GTG 
valine codon (defined by SEQ ID NO:l, nucleotides 116- 
118) . The primer hRCE3P-RI incorporated an EcoRI site 

25 immediately downstream (3') of the TGA stop codon (defined 
by SEQ ID NO:l, nucleotides 995-997) . 

Following amplification, the PCR product was 
digested with EcoRI and Xbal and subcloned into the 
respective sites of the expression vector, pACAl' or 

30 pACA5' . pACAl' contains 1000 base-pairs of the yeast MEV1 
promoter including the translational initiation codon ATG. 
The MEV1 promoter was amplified by PCR from yeast genomic 
DNXlising the following primers: 

MEV1-1 (containing Hindlll site) : 5'-GGA AAA GCT TTT 

.35 GAA GCA a^JSTG^GAG-J 1 - 3 ' _(SEQ ID NO:44); and 
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MEV1-2 (containing XJbal site) : 5'-GTA AAC GTT CTA 
GAC ATT TTG AGT ACG TC-3' (SEQ ID NO: 45). 

Vector pACA5' contains 700 base-pairs of the 
yeast ERG12 promoter and the first four codons of the 
5 ERC12 coding sequence. The ERG12 promoter was amplified 
by PCR from yeast genomic DNA using the following primers: 

ERG12-1 (containing SphI site) : S'-GGC CGG GCA TGC 
AAC CTA TTC GAG ATG AAT GCG-3' (SEQ ID NO: 46) ; and 

ERG12-2 (containing XJbal site): 5'-CCG GCC TCT AGA 
10 CGG TAA TGA CAT ATT GAC AGT- 3 ' (SEQ ID NO: 47) . 

Both expression vectors include an EcoRI-Narl 
fragment containing the yeast PGK1 transcriptional 
termination sequence. A fragment of DNA comprising the 
yeast PGK1 termination sequence was amplified by PCR from 
15 yeast genomic DNA using the following primers: 

PGK 1 (containing an EcoRI site) : 5'-GAT TGA ATT CAA 
TTG AAA TCG ATA G-3' (SEQ ID NO: 48); and 

PGK 2 (containing a Narl site): S'-CCG AGG CGC CGA 
ATT TTC GAG TTA T-3' (SEQ ID NO: 49) . 

The resulting plasmids, pACA7001 (alternatively 
referred to herein as pAB14) and P ACA7002 (alternatively 
referred to herein as pAB13) , represent translational 
fusions between the MEV1 ORF and the ERG12 ORF, 

respectively. 

The yeast strain JRY5317 (Boyartchuk et al., 
SUEXiD is an a-mating type strain in which both CAAX 
protease genes have been deleted. For transformation of 
the hRCE expression plasmids, JRY5317 was grown overnight 
in rich medium (2% Bacto Peptone, 1% Yeast Extract, 2% 
Glucose) . The transformation was performed by the lithium 
acetate method (Gietz and Schiestl, Xsasl, 7, pp. 253-263 
(1991)). Transformants were selected for uracil 
prototrophy on minimal medium. 

Plasmids pACA7001 and pACA7002 were each 
35 transformed into__the_yeast strain J RY5317, an a-mating 
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type strain in which both CAAX protease genes have been 
deleted (Boyartchuk et al., suei^.) (Example 3). Halo 
assays were performed by replica plating patches of the 
yeast transf ormants onto a lawn of a sst2 tester lawn as 
5 described (Boyartchuk et al., supra ) . Cells harboring an 
sst2 mutation are rendered supersensitive to mating 
pheromone and undergo a Gl arrest when exposed to 
pheromone. This creates a halo around the patch of mated 
yeast cells, 

10 Example 4: Expression of the human hAFC cDNA 

in a yeast mutant. 
The ORF described herein which encodes a homolog 
of the yeast AFC1 gene (SEQ ID NO: 37; Figure 8) is 
expressed in a yeast strain deleted for both CAAX protease 

15 genes, yAFCl and yRCEl (Boyartchuk et al., supra ) as 
described in Example 3. To create the expression 
construct, the human AFC ORF (SEQ ID NO: 37) is amplified 
from the first GCG valine codon to the stop codon at ca. 
nucleotide 721 (see Figure 7) and inserted into yeast 

20 expression vectors as described in Example 3. 

Example 5: A screening method for identifying 

inhibitors of mammalian CAAX processing 
enzymes . 

A surrogate organism that is easy to manipulate 
25 and maintain, such as Saccharomyces cerevisiae, may be 
transformed with an expression vector encoding hRCE or 
hAFC sequences. Such a recombinant organism may serve as 
a template for in vivo screening for inhibitors of the 
protease. 

30 Such an assay could be direct or indirect. A 

direct assay could be created based upon the cleavage of a 
CAAX-containing proteins such as ^-factor in a halo assay 
or mating assay- as described in (Boyartchuk et al.., 
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snora ) . Another assay could be based upon the requirement 
of CAAX proteolysis for Ras function. Such an assay would 
utilize a yeast strain possessing hRCE and an activated 
allele of Ras such as the Glyl9 to Vall9 mutation. Cells 

5 that harbor an activated allele of Ras are susceptible to 
acute heat shock at 55°C. Thus, one could screen for 
inhibitors of Ras by screening or selecting compounds that 
cause a heat-shock resistant phenotype. 

Alternatively, an assay for hRCE could be based 
10 upon an indirect effect of modulating the CAAX protease 
activity. For example, cells could be transformed with 
either a normal or non- functional version of hRCE. The 
expression of other selected genes in an organism could 
then be assessed. Once a gene was found whose expression 

15 was dependent upon the function of hRCE, that gene could 
then serve as the basis of an assay. 

Example 6: A screening method for identifying proteins 
that interact with mammalian CAAX processing 

enzymes 

20 A protein which physically interacts with a 

mammalian CAAX protease according to this invention may be 
isolated using the "two-hybrid system" (see, e.g., C.T. 
Chien et al., "The two-hybrid system: a method to identify 
and clone genes for proteins that interact with a protein 

25 of interest", Proc. Natl. Acad, Sci . U tS.ft.r 88(21), pp. 
9578-82 (1991)). This method takes advantage of the 
multi-domain structure of the yeast transcriptional 
activator protein GAL4, which has a DNA binding domain 
separable from a transcriptional activation domain. 

30 Expression plasmids are constructed to encode one of two 
hybrid proteins. One hybrid consists of the GAL4 
DNA-binding domain fused to a mammalian CAAX protease 
polypeptide of this invention. The second hybrid consists 
of the <SAL4 activation domain fused to protein sequ ences 
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encoded by a library of genomic DNA or cDNA fragments .from 
a mammalian or other cellular source. (The plasmids are 
co- trans formed into an appropriate yeast strain, or are 
individually transformed into yeast strains of opposite 
5 mating types having genetic markers that allow selection 
of diploids.) Interaction between the CAAX protease and a 
protein encoded by one of the library plasmids leads to 
transcriptional activation of a reporter gene of choice 
which contains an appropriately positioned GAL 4 binding 
10 site. The advantage of this method is that the gene which 
encodes the protein that interacts with the CAAX protease 
may be identified directly from the library. 

Example 7: Mutagenesis of the Histidine-Rich Region of 
hRCEl 

15 Each histidine comprising the histidine-rich 

sequence "HxHH" of the human Reel protein (SEQ ID NO: 2, 
amino acids 172-175) were changed individually to an 
alanine residue by site-directed mutagenesis, 

Plasmids pAB15, pAB16, and pAB17 were 

20 constructed as follows: An 888 bp Xbal-EcoRI fragment was 
cut from pAB13 (see pACA7002, Example 3) and cloned into 
pBluescript SK. Histidines 172, 174 and 175 were each 
mutated to an alanine using the QuikChange** Site-Directed 
Mutagenesis Kit following manufacturer's directions 

25 (Stratagene) . The following complementary pairs of 

oligonucleotides were used (underlined bases are those 
that were changed relative to the wild-type sequence (5'- 
CTC TTT TTT GGA GTT GCC CAT TTT CAC CAT ATT ATT GAG CAG 
CTG -3') (SEQ ID NO: 1 , nucleotides 611-655): 

30 For H172A 

hRCEl-Ml: 5' -CTC TTT TTT GGA GTT GCC Q£l TTT CAC CAT ATT 
ATT GAG CAG CTG- 3' 
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hRCEl-M2: 5'-CAG CTG CTC AAT AAT ATG GTG AAA M£ GGC AAC 
TCC AAA AAA GAG- 3' 

For H174A 

hRCEl-M3: 5' -CTC TTT TTT GGA GTT GCC CAT TTT £££ CAT ATT 
5 ATT GAG CAG CTG-3' 

hRCEl-M4: 5' -CAG CTG CTC AAT AAT ATG G2£ AAA ATG GGC AAC 
TCC AAA AAA GAG- 3' 

r^-r H175A 

hRCEl-M5 : 5' -GGA GTT GCC CAT TTT CAC fiEt ATT ATT GAG CAG 
10 CTG CGT TTC-3' 

hRCEl-M6: 5'-GAA ACG CAG CTG CTC AAT AAT . £££ GTG AAA ATG 
GGC AAC TCC- 3' 

Mutations were confirmed by sequencing on an ABI 
Prism 377 DNA Sequencer using the ABI Prism Dye Terminator 

15 Sequencing Kit with AmpliTaq DNA Polymerase, FS. The 888 
bp Xbal-EcoRI fragments from those clones with the desired 
mutations were then cloned back into P AB13 (replacing the 
corresponding wild-type hRCEl Xbal-EcoRI fragment) 
generating PAB15 (hRCEl-H172A) , P AB16 (hRCEl-H174A) and 

20 PAB17 (hRCEl-H175A) . 

Example 8: Whole Cell Co-Culture Assay for Identifying 
Rcelp Inhibitors 
A strain of yeast is constructed in which the 
endogenous RAS1 gene has been deleted and the endogenous 
25 RAS2 gene has been replaced with the temperature-sensitive 
ras2-23 allele, which sensitizes the strains such that 
their growth rates depend upon the function of Rcelp. 
Cells from this strain are transformed with a high-copy 
(2u) Plasmid (e.g., a LEU2, URA3 or HIS3-marked plasmid 
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whose presence can be selected in the appropriate leu, ura 
or his auxotrophic strain background) which either does 
(Strain 1) or does not (Strain 2) additionally encode: a) 
the RCE1 gene driven by a strong (e.g., the ADH1) 
5 promoter; and b) a marker gene (e.g., green fluorescent 
protein (GFP) ) under the control of a strong promoter. 
Strain 1 and Strain 2 transf ormants are selected on the 
appropriate medium (e.g., selecting for leu, ura or his 
prototrophy) . Strain 1 (but not Strain 2) transf ormants 
10 are also selected based on their fluorescence (from 
expression of the plasmid-linked GFP marker) . 

Strain 1 and Strain 2 are grown independently to 
a point where they are in mid-log phase (O.D.gQQ = 1.0) 
and are then mixed at an initial ratio of 1 cell of Strain 
15 1 for every 5,000 cells of Strain 2. The mixture does not 
exhibit significant fluorescence because Strain 1, which 
harbors the multi-copy plasmid encoding RCE1 and the 
fluorescent marker GFP/ is present in such low abundance. 
Following the initial mixture of the two strains, the co- 
20 culture is aliquoted into wells of 96-well plates. 

A typical 96-well plate will handle about 0.25ml 
of liquid. One OD 60 o of yeast in rich medium (YPD) is 
about 1 x 10 7 cells/ml. Taking into consideration that we 
want a 5,000-fold excess of Strain 2 in the mixture and we 
25 want to have at least 50 cells of Strain 1 in the mixture 
to take into account inevitable variations, then a co- 
culture experiment where the starting ODgQQ = 0.1 would 
include 50 cells of Strain 1 and 250,000 cells of Strain 
2. This gives a Strain 1-to-Strain 2 ratio of 1:5,000. A 
30 typical yeast overnight growth in a 96-well plate will 
produce an ODgQQ This translates to a 40-fold 

increase in growth, or 5.3 generations. 

To identify competitive inhibitors of Rcelp, a 

dilution series (e.g., ten-fold) of each selected 
35 potential inhibitor resuspended in an appropriate medium 
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is prepared and an aliquot of each test sample added 
individually to the wells of the 96-well plates which 
contain the co-cultured strains. As a control, an equal 
volume of each medium in which the test inhibitors are 
5 suspended (now without the test inhibitor) is added to a 
well containing the strain co-culture. The co-cultures 
plus test chemicals and controls are incubated until the 
cells have divided numerous times. The cells are diluted 
into fresh medium and allowed to undergo ca. 5 generation 
10 doublings (as determined by spectrographs absorbance 

readings) . At various points during this growth period, 
the relative abundance of Strain 1 in the mixture is 
assessed by fluorescence emission from the 96-well plates. 

Example 9: Heat Shock Assays for Measuring hRCE 

15 Function and for Identifying Rcelp 

Inhibitors 

Plaarcids 

Plasmid pABl was constructed as follows: The 
backbone plasmid for pABl was Yeplacl95 <2u, «RA3; Gietz 
20 and Sugino, 1988) and contains an insert consisting of two 
parts: green fluorescent protein (GFP) and 
phosphoglycerate kinase (PGK1) terminator. The coding 
region of GFP from Aeqvoria victoria containing the S65T 
mutation (Ormo et al., 1996) was amplified by PCR using 
25 oligonucleotides: 

5P-GFP-ORF (5 '-CAT GTC TAG AGG AGA AGA ACT TTT C-3») 

(SEQ ID NO: 51) ; and 

3P-GFP-ORF (5'-CGC GAA TTC CTA TTT GTA TAG TTC A-3'); 
and has Xbal and EcoRI sites engineered on the 5' and 3' 
30 ends, respectively. 

The PGK1 terminator was amplified by PCR using 
the following oligonucleotides: 

PGK 1 (containing an EcoRI site) : 5' -GAT TGA ATT CAA 
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TTG AAA TCG ATA G-3 f (SEQ ID NO: 48); and 

PGK 2 (containing a Narl site): 5 f ~CCG AGG CGC CGA 

ATT TTC GAG TTA T-3 f (SEQ ID NO: 49) and consists of 263 bp 

immediately downstream of the stop codon of PGK1. EcoRI 

5 and Narl sites were engineered on the 5 T and 3 1 ends of 

the PGK1 terminator. 

Plasmid pAB20 was constructed as follows: The 

dominant allele of RAS2, RAS2 vallS (D. Broek, et al., 

Cfiii, 41, pp. 763-769 (1985)), was cloned as a Xhol- 

10 Hindu I fragment (1906 bp) into the CEN-LEU2 plasmid 

pRS315 (R.S. Sikorski and P. Hieter, Genetics . 122, pp. 

19-27 (1989)). This plasmid carries the coding region for 
vall9 

Ras2p as well as 639 bp upstream and 298 bp 

downstream of the RAS2 open reading frame. 
15 Plasmids pAB13 hRCEl, pAB15 H172A, pAB16 H174A; 

and pAB17 H175A were made as described in Examples 3 and 
7. 

Heat Shock assays in veast rells 

Yeast cells wild-type for RCE1 (W303) (MAT&; 
leu2-3,112; ade2-101; his3-ll; trpl-1; ura3-l) were used 
to create an isogenic, double mutant strain lacking both 
the Rcelp and Afclp CAAX proteases {Arcel, Aafcl). The 
wild-type (W303) or Arcel, Aafcl strains were separately 
transformed with plasmid pAB20 expressing the dominant 
mutant RAS2 Va119 gene. Transformed Arcel, Aafcl cells 
then were co-transformed with one of the following URA3 
marked, high-copy (2u) plasmids: pABl (vector control); 
pAB13 (hRCEl), pAB15 (H172A) , pAB16 (H174A) ; and pAB17 
(H175A) (as described in Figure 10) . 

Yeast cell transf ormants were grown at 30°C for 
approximately 48 hours in synthetic complete media lacking 
uracil and leucine and supplemented with 2% glucose, to an 
approximate OD600 of 4-6. The cultures were diluted 1:10 
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in water in 0.5 ml PCR tubes and placed in a PTC-100 
thermocycler (MJ Research, Inc.). The heat-shock regimen 
was performed as follows: 25°C 3 min; 55°C 1 min, 25°C 3 
min. The cultures were further diluted in water and 
5 plated for viability. The number of surviving colonies 
were compared with those from parallel samples that had 
not undergone heat shock. The results of this experiment 
are shown in Figure 11. 

ffo^t .hnrv assays to ide p fi fy TAAX protease inhibitors 
10 The above heat shock assay may be modified by 

one of skill in the art to identify inhibitors of Rcelp in 
the presence of an activated RAS2 allele. A dilution 
series of each selected potential inhibitor may be tested 
(e.g., as described in Example 8). Rcelp inhibitors may 

15 be identified by their ability to increase in a 

concentration-dependent manner the viability of heat- 
shocked cells which express activated Ras2p, which express 
Rcelp from a chromosomal and/or from a single or multi- 
copy plasmid. In addition, the cells can be deleted for 

20 the yRCEl gene to identify inhibitors of the human hRcelp 
protein. 
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What is claimed is: 

!• An isolated, recombinant or synthetic DNA 

molecule which comprises a nucleic acid sequence 
selected from SEQ ID NO: 1, SEQ. ID NO: 3, SEQ ID NO: 
5 or SEQ ID NO: 37. 

2 - The DNA according to claim 1 wherein the DNA 
comprises a nucleic acid sequence of SEQ ID NO: 1. 

3 - An isolated, recombinant or synthetic DNA or 
polynucleotide comprising a nucleic acid sequence 
that is substantially homologous to any of the DNA 
molecules of claims 1 or 2. 

4- ^ isolated, recombinant or synthetic DNA or 
polynucleotide comprising a nucleic acid sequence 
that hybridizes under stringent conditions to any of 
the nucleic acid sequences of claims 1 or 2. 

5 - The DNA or polynucleotide according to any one 
of claims 3 or 4, wherein said polynucleotide is 
selected from the group consisting of RNA, cDNA, 
genomic DNA, synthetic nucleic acids, chemically or 
biochemically modified nucleic acid comprising 
non-natural or derivatized nucleic acid bases, or 
mixed polymers of any of the above. 

6 - An isolated, recombinant or synthetic DNA or 
polynucleotide comprising the antisense RNA or 
antisense DNA sequence of the DNA or polynucleotide 
according to any one of claims 1-5. 

7 - A DNA molecule according to any one of claims 
1-6, further comprising an expression control 
sequence operatively linked to the DNA or 
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polynucleotide sequence. 

8. A polynucleotide comprising at least eight 
consecutive nucleotides of the DMAs. of claims 1 or 2 
or the polynucleotides of claims 3-6. 

9. A vector comprising a DNA according to claim 1 
or 2 or a polynucleotide according to claims 3-6. 

10. A host cell transformed with a molecule 
according to any one of claims 1 to 9. 

11. The host cell according to claim 10, wherein the 
molecule is a DNA molecule which is integrated into 
the genome of said host cell. 

12. A method for producing a polypeptide comprising the 
step of culturing a host cell transformed with a DNA 
molecule according to claim 7. 

13. A polypeptide encoded by a DNA or polynucleotide 
comprising a nucleic acid sequence selected from SEQ 
ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 
37. 

14. A polypeptide encoded by a DNA or polynucleotide 
comprising a nucleic acid sequence selected from the 
group consisting of: 

(a) a nucleic acid sequence that is substantially 
homologous to any of the nucleic acid sequences 
of claims 1-5; and 

(b) a nucleic acid sequence that hybridizes under 
stringent conditons to any one of the nucleic 
acid sequences of claims 1-5. 
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selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 
6 or SEQ ID NO: 38. 

16. A polypeptide characterized by being at least 
80% identical in amino acid sequence to any one of 
the amino acid sequences selected from SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 38. 

17. An antibody that binds specifically to an 
epitope of any one of the polypeptides of claims 13- 
15. 

18. A pharmaceutical composition comprising any one 
of the polypeptides of claims 13-15. 

19. A pharmaceutical composition comprising an 
inhibitor of a mammalian CAAX processing enzyme, said, 
enzyme comprising amino acid sequences selected from 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID 
NO: 38. 

20. A method for identifying a competitive inhibitor 
of any one of the polypeptides of claims 13-15, 
wherein said method comprises the steps of: 

a) co-culturing two distinguishable strains of 
cells which differ by the presence or absence of 
a functional RCE1 or AFC1 gene; and 

b) selecting compounds which increase the relative 
proportion of cells expressing the functional 
RCE1 or AFC1 gene compared to those which do 
not. 

21. A method for treating a CAAX-protein mediated 
disease or disorder in a patient by administering a 
pharmaceutical composition comprising any one of the 



SUBSTITUTE SHEET (RULE 26) 



WO 98/54333 



- 70 - 



PCTAJS98/I1415 



polypeptides of claims 13-15. 

22. A method for treating a CAAX-protein mediated 

disease or disorder in a patient by administering a 
pharmaceutical composition according to claim 19.. 
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INCBII:'- :-VXREFdb:-;|«|?h ? . . ; ;. [ .• : • . ■ . 7 - | 

Cross -referencing the Genetics of Model Organisms with Mammalian 

Phenotypep 

Results for TBLASTN Search vs. dbEST: 

Gene name(s): RCE1 Searched: 26 May 97 

GenBanfr Mapped Mapped 

NCBI ID: Accession: Org,: Pj^lue: N: Hiscorj*: (XREF) (Other) New? 

604642 W96411 Human 1 . 4 e - 1 9 4 79 No No 

Clone ID: 35862B 

Organism: Homo Sapiens 

Library: Soares fetal heart NbHH19W 

source : Wilson RK 

E-mail : est@watson.wustl.edu 

Phone : 314 286 1800 Fax: 314 286 1810 

To order this IMAGE Consortium clone, visit either the Genome Systems, 
Research Genetics, or ATCC World Wide Web pages. 

Minus Strand KSPs 

Score a 79 (37.3 bits), Expect « l.4e-19, Sum p(4) = l.4e-l9 
Identities « 14/28 (50%), Positives = 19/28 (67%), Frame = -3 

Query: 211 S I LLTTCFQ I LYTTLFGGLTKFVFVRTG 238 

♦I L+ FQ YT +FG T F+F+RTG 
Sbjct: 162 NI FLSAAFQFS YTAVFGAYTAFLFI RTG 79 

Score « 61 (28.8 bits), Expect a 1.4e-19, Sum P(4) a 1.4e-19 
Identities a 11/15 (73%) , Positives - 12/15 (80%) , Frame a -l 

Query: 188 LFFGLAHAHHAYEQL 202 

LFFG+AH HH EQL 
Sbjct: 230 LFFGVAHFHHIIEQL 186 

Score « 53 (25. 0 bits), Expect a 1.4e-19, Sum P(4) « 1.4e-19 
Identities a 9/19 (47%), Positives a 13/19 (68%), Frame = -3 

Query: 239 GNLWCCIILHALCNIMGFP 257 

G+L ++ H+ CN MGFP 
Sbjct: 81 GHLIGPVLCHSFCNYMGFP 25 

Score a 48 (22.7 bits), Expect a i.4e-19, Sum P(4) « l.4e-19 
Identities a 8/19 (42%), Positives « 14/19 (73%), Frame - -1 

Query: 147 RNFIFAPITEEIFYTSMLL 165 

RN ♦ AP+TEE+ + ♦ +L 
Sbjct: 338 RNQVI AP LTEELVFRACML 282 

FIG. 1A 
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gejj |7] - -/XREFdbxwfr. 
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Cross-referencing the Genetics of Model Organisms with Mammalian 

Pheno types 




GenBank 
Tjqsi IP : Accession: Org_: 



Mapped Mapped 
p value : N: Higcore: jjCREF} (Other) New? 



482520 Hlfiifll 



Human B.4e-15 



168 



No 



No 



Organ ism : 
Library : 
Source. : 
E-mail : 



284572 

Homo Sapiens 

Soares multiple sclerosis 2NbHMSP 
Wilson RK 

est®vatson . wuatl . edu 

314 286 1800 Fax.: 314 286 1810 



To order this IMAGE Consortium clone, visit either the Genome Systems. 
Research Genetics, or ATCC World wide Web pages. 

Plus Strand HSPs 

score - 168 (77.8 bits). Expect - 8.4e-15, P - 8-4e-l5 
Identities = 36/80 (45%). Positives - Sl/80 (63%). Frame = +2 

Query: 119 mlfndu-t^v^w ++gR E+0ADA+AKKLG ++L Al.IL OT, D 

SbjCt: 32 IITOFIFSPYNEVl^FCiTVWRRFEFQADAFAKKI^KACTLYSAliIKliNKDOTjGFPVSD 211 

Query: 179 PLYS S YHYSHPTLAERLTAL, ise 

L+S +HYSHP L ERL AL 
SbjCt: 212 WLFSMWHYSHPPLLERU3AL 271 



A human, mouse, and rat subset of dbEST is used for all "f^*^**'- 
Def ault BLAST parameters and SEG low complexity masking are used for each 

Search. 

Go sack 1 to the complete hit list for this query- 
Return | to your Query Summary. 

onlin. Help is available for this form, simply click any label 
for assistance. 



Xref < nf oaomail . bs . 1hu . edu 
bouglass E Bassett. Jr. (bassettancbi . nlm. nih . gov) 
http : / / www . ncbi . nlm ■ nih . qov/XREFdb/ 

FIG. 1B 
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Cross-referencing the Genetics of Model Organisms with Mammalian 

Pheno types 

Results for TBLASTN Search vs. dbEST: 



Gene name(s): RCE1 Searched: 26 May 97 



GenBank Mapped Mapped 

NCBI ID : Accession : Org : P value : N: Hiscore : (XREF) (Other) New? 

636296 AA022288 Mouse 1.6e-8 1 126 Ordered No NEW 

Clone ID: 456971 

Organism : Mus musculus 

Library : Scares mouse placenta 4NbMP13.5 14.5 

Source : Marra M/Mouse EST Project 

E-mail : mousees t@watson . wus t 1 . edu 

Phone : 314 286 1800 Fax: 314 286 1810 

To order this IMAGE Consortium clone, visit either the Genome Systems , 
Research Genetics , or ATCC World Wide Web pages. 

Plus Strand KSPs 



Score « 126 (59.2 bits), Expect « i.6e-08, P = i.6e-08 
Identities = 25/68 (36%), Positives - 41/68 (60%), Frame * +1 

Query: 4 3 QIRFKNFNLFIIAI^LFHFLEYYITAKYNPLK^ 102 

Q + +F +LSLFH+ EY +TA NP ♦ +SFLLN+ y A ♦ +S +B 

Sbjct: 316 QS S WNHFG WYVCS LS LFHYS E YL VTAVNNP KS LS LDS FLLNHS LE YTVAALS S W I EFTLE 495 

Query: 103 SFLFPDLK 110 

+ +P+LK 
Sbjct: 212 NIFWPELK 519 

This EST clone was selected for mapping by the XREF project based upon 
Similarity to the STE14/YDR410C/D9461. 1 gene (TBLASTN P-value « 1.6e-8) . 

Order authorized: 15 Jan 97 
Clone ordered: 18 Apr 97 

Clone received: 24 Apr 97 



A human, mouse, and rat subset of dbEST is used for all XREFdb EST 
searching. Default BLAST parameters and SEG low complexity masking are 
used for each search. 

Go Back | to the complete hit list for this query. 
Return I to your Query Summary. 

Onlin Help is available for this form, simply click any 
label for assistance. 



FIG. 1C 



SUBSTITUTE SHEET (RULE 26) 



PCT/US98/11415 

W.O 98/54333 

4/20 



8 



8 

CM 



8 



O 
O 



O 
O 
in 



O 4 CD CL 

o \ o < 

u I u «-> 

CD 1 CJ CD w 

O 4 LJ < 

CD 4 CJ CD 

1 < -J CD 

o i o cd 

1 CD < 

LJ CD O 

LJ lj 

o lj 

U U CD 

lj »- 

lj lj 



LJ 
LD 
CD 
O 
CD 
CD 
CD 



4 < _j 



LJ 

CD 
o 
cd 
o 
LJ 
o 
o 
lj 



o 
u 

cd <■ 

lj 

u 

o < 

u 



cd 
< 

(J I u 



CO 



< 
< 
cd 
LJ 

CD 

< 

< 
< 

o 

LD 



< 

o 
lj 

LJ 

o 
cd 
lj 
lj 
cd 

CJ 



i— UJ. lj 



lj 
cd 
cd 
cd 
cj 

CP CL 

o 
u 

cj cc 

CD 
CJ 

UJ 

CJ 

_ , o 

VJ 4 CD CL 



CD 




CD 4 


CJ 


< 




CD j 


o 




X 


< 4 




CD 




U 


CD 


O 






CD 


t— 


a 


CD \ 


CJ 


o 




< 




lj 




o 


CD 




cc 


LJ J 


CD 








< 


cd 




< " 




cd 


CL 


o 


CD 


CD 




CD 


CJ 


LJ 




CD 


CJ 


< 


—J 


< 




CD 




CJ ■ 


h CD 


»— 




<. 




»— 


UJ 


CJ 


CD 


LJ 




t— 


< 


CD 




CJ 




O 


co 


< • 


4- 











CD 4 


LJ 




4 


< 


_J 


CJ 1 


CD 




< \ 






LJ j 


CD 


CL. 


LJ 1 






LJ 


CD 




CD 


LJ 


CD 


CD 


CJ 




CD 


CJ 






- < 


-J 


LJ 


CD 




LJ 


CD 






f < 


Lw 


»— 








Y < 






< 




LJ 


CD 






1 < 




»— 


♦ < 




< ' 







LJ 
CD 
CD 



CJ * 



< 



a 

CJ 
CD 



LJ 
CD 
CJ 



< , 
LJ + CD 

< 4 »- O 
CD 
< 
CJ 



O 

o 

CD 



< 

LJ 

CD i LJ 
< 

O 4 CD 

CD 4 cj 
< 

LJ 
LJ 
LJ 
CD 

O | u 
CD 



CD CL 
CD 
CD 

CJ CD 



CJ 



o 

CD 



CD 
<t 
LJ + 
CD 
CD 
CD 



LJ 
LJ 
< 
CD 
< 
« ' LJ 
CD 



lj j 


CD 






a 4 


CJ 




CD 


CD 4 


O 


CC 


CJ 


CJ 


CD 




< 


CD 


CJ 


to 




U J 


CD 


LJ 




- < 




LJ 


< ] 










< 


> 


LJ 


CD ^ 


LJ 




CD 


CD 


LJ 






LJ ' 


■ CD 


to 


LJ 




■ < 




GC 


CD 


CJ 






\ < 


-J 


LJ 


CJ 


• tD 






CD " 


r cj 




LJ 




< 


—I 


LJ 


LJ 


CD 




CD 


< 






< 


CD 


1 ° 


cc 


LJ 


CJ 


T U 






CD 


4 LJ 




LJ 




T < 


-J 


LJ 



LJ \ CD 

CD 4 O 

CD + CJ CD 

CD t CJ 

t- 4 < 

< 4 a 

CD t LJ 

CD T LJ 



CD 
LJ 
LJ 
LJ 

CJ 
CD 



CD ♦ 


CJ 




CD \ 


CJ 




CD * 


LJ 


cc 


<x 4 






CD j 


LJ 




CD 


LJ 






< 




CJ j 


CD 






» < 


— » 


CJ 


CD 




CD 


LJ 




»— 


■ < 




CJ 


CD 




CD • 


r lj 






1 < 


> 


CD 


LJ 




CJ 


CD 




CD 


\ LJ 


LJ 




r < 





-I CD 



LJ \ CD 
LJ 
< 

CD | U 

< — J 
LJ 4 O 
CD + LJ 
►-]<—» 
»- | < 
LJ 



»- 4 <t 



LJ 
LJ 
U 
LJ 
CD 



CD 
CD 
CD 
CD 
LJ 



LJ 4 


CD 


t— 4 


< 


LJ \ 


CD 


LJ i 


CD 


CD ^ 


* LJ 


j 


< 


LJ 


CD 


CD J 


CJ 


CJ 


CD 


LJ ' 


• CD 


LJ 


CD 


CD 


LJ 


CD 


I < ~ > 


CD 


LJ 




r < 


CJ 


CD 


O 


1 & 




4 < 


LJ 


4 CD 


C5 


T ° 



LJ 


CD 




LJ 


CD 




CD ] 


CJ 


CD 


CD 


CJ 




CD - 


- LJ 






< 


31 


< 






LJ 


CD 




CD 


LJ 


LJ 




r < 




CD 


t CJ 




CC 


GG 


CL 


<i 


i *~~ 




LJ 


T 0 


< 


CD 


1 ° 




< 







u 

LJ 



CD 
CD 
CD 



CJ 
< 

< 4 



CO 



CD 
CD 
LJ 
CD 
CD 
CD 



CJ CD 
CJ 
CD 

CJ CD 
CJ 
CJ 



CD 
CD 
<t 
CD 

CJ CJ 

< 

CD 

CD <t 

CJ 

CD 

< -J 

■ - CD 
CD 

CJ CO 
^~ 
CD 

< -J 

CD 
CD 

CJ CJ 
< 
+ CD 

CD CO 
< 
CD 

< u_ 

CJ 

o 



O 4 cj 
< 

CJ 
CJ 
LJ 
LJ 
< 
LJ 



V- + <t 
O 4 CD 



CD 
CD 

CD CL 
CD 

CD CO 
i- 4 < 
CJ + CD 

t- 4 < -j 



CJ 
CD 
CJ 
CD 



CD 
LJ 

CD < 



LJ 4 CD 
LJ + CD 
CJ 4 CD < 
CD 
CD 



CJ 
CJ 



LJ 
LJ 
CJ 

LD , 

b_ 4 < 



CD 
CD 
CD 



CD 
< 

cj + 



>- 4 < -» 



LJ 4 


CD 




>— 


CD 


CJ 




CD 


LJ i 


CD 


< 


CD 


CD 


CJ 




CD 


CD 


CJ 






U \ 


CD 


< 


CJ 


CD 


CJ 




CD 


CD • 


- CJ 






CD 


CJ 


CD 


LJ 


CD 


1 CJ 




CD 


CJ 


f CD 






CJ 


CD 


CO 


CD 


H- « 


Y < 








< 




CJ 


< 


| ^* 


Z 


CD 


< 


I *~ 




CD 


CD « 


*- CJ 




CD 



CJ | 


CD 




»— 4 


< 




CD 4 


CJ 


CO 


< 7 






cj 4 


CD 


CO 


CJ j 


CD 




< 




S 1 


CJ 




^ H 


- < 


> 


CD 


CJ 




CD J 


CJ 






< 


> 


CD 


CJ 




CD • 


• CJ 






1 < 


—1 


CJ 


1 tD 




LJ 


CD 




H- 


< 


> 


CD - 


r ° 




CJ 


CD 


CO 


CD 


1 0 


< 






CJ 


t u 




LJ 


T ^ 




<t 


I *~ 





CD 4 


CJ 




LJ 4 


CD 


< 


CD A 


• CJ 




< 






LJ 


CD 


CL 


LJ 


CD 




CJ 1 


CD 






» < 


Lv 


< 


< 






< 






< 










CJ - 


r u 


CD 


CD 


CJ 


CD 


CJ 




CD 


CJ 




< 




LJ 


CD 


r 




CD 


4 cj 




h- 


4 < 


_J 


CJ 


4 CD 





cj 4 


CD 






«< 


> 


CD ] 


CJ 






•< 






- < 


> 


CD I 


CJ 




CD < 


CJ 




< < 






< ■ 






CD ' 


• CJ 




»— 


f «* 


•J 


LJ 


' CD 




CD 


\ CJ 




CD 


LJ 


CD 


CD • 


r lj 






4 < 




< 


4 H- 


Q 



CD 
CJ 

< z: 

<£ 4 I- 

4 < 

CD 4 CJ LJ 

I < 

CJ + LD 
CJ 4 CD < 
CD 
CD 
CD 
CJ 
CJ 



CJ 

CJ 

LJ CC 
CD 
CD 

>_ 4 < L- 
)- \ < 

CJ 

+ < 



CD 
CD 



CJ 
CJ 



CD 
CD 
< 
CJ 



CJ 
CJ 



CD 
<t 
CJ 

<c 

CJ 
CJ 
CJ 

< 

CD 
CJ 

+ < 

CD 
CJ 

< 



CD 
CJ 
CJ 
CJ 



CJ 4 CD 

>— 4 < 

t— 4 < 

CJ + CD 

CD 4 CJ 

CJ \ CD 



_ , CD 
t- 4 < Lu 

4 < 

H- U f C3 

CD 
CD 
CD 



CD 
LJ 
CD 



LJ 
CD 
CJ 



< »- « 
H- 

CD 

♦-+<-- 
-< 

CD 

< > 

CJ 
CJ 

CD < 
CJ 
CD 

CD CL 



I- 4 < 
< + »- 
CD 4 CJ 

4 < 



< 

CJ 



CJ CD 
CJ 
CJ 



CD 4 


CJ 


< \ 




cj H 


CD 


CD 


CJ 


CD J 


CJ 




< 


CJ 


CD 


CJ « 


- CD 


< 


. 


CD 


i CJ 


^~ 


\ < 


CD 


CJ 


»— « 


Y '< 


H- 


1 < 


CJ 


4 CD 



I 

< 

OJ 
LL 



U C3 
CJ 4 CD 



CD 



CD 

s ; 
§1 



CD 



CJ 
CJ 

< 

LJ 
< 
LJ 



< 


-J 


>— 


CD 




< 


CD 




CD 


CD 


H- 


CD 


»— 






CD 




< 


<t 


-J 




CD 




LJ 


CJ 




K- 


< 


_i 


CJ 


CD 




H- 


CD 




LJ 


CD 


CO 


CD 


< 




< 


>— 




LJ 


CD 




CD 








CD 




< 



CJ CJ 



CD 4 


LJ 




< 




UJ 


CD 


CJ 




CD 


LJ 




< 




UJ 


CD J 


- CJ 




< - 


»— 




CJ 


CD 


1— 


< 






CD 


LJ 




« 


r < 


-J 


CJ 


CD 




CD 


LJ 




CJ 


CD 


CL 


CJ 


4 CD 




CJ « 


T ^ 




CJ 


1 0 


< 



CJ 



CD CO 



CD 
CJ 



CD 4 CJ 

CJ f CD 

< + I- 
CD 4 CJ 

CD 4 cj 

< 4- »— O 
cj 4 CD 



CJ 

< 

CD 
CD 
CJ 
CD 



CJ 
CJ 
CD 
CJ 



SUBSTITUTE SHEET (RULE 26) 



WD 98/54333 



PCT/US98/11415 



5/20 



o 

O 



§ 

00 



o 
o 



O 
O 
O 



8 



§ 

CNJ 



CD 
LJ 

cd 

o 
cd 



CD 



(_> 
CD 
l_J 
«« 
CD 
CJ 

<: 
O 
< 

+ < 



CD 
CD 
CD 
CD 
J— 
CD 

CD 



O 4* CD 
CD 

CD 



CD 
LJ 



CD 
CJ 
CD 

CJ 
CD 
< 
CJ 
CD 
< 
CD 



< 

cj 

CJ 
CJ 
CD 



CD 
< 
CD 
CD 



CJ 
CD 
CJ 
CJ 
CJ 
CD 
>— 
CJ 

< 

CJ 



CO 



CD 

< — 



CD 



CJ 

LJ CD 



CD 
CJ 



CO 



CJ 

H- O 

CD 
CD 

cj cr 

CD 
CD 

<C Lk. 

<< 

< 

CJ cr 

CD 

CJ 

< -J 

CD 
CJ 

h- O 

CD 
CJ 

UJ 

CJ 

< 

•< — 



< — »- 



< • 

X 

CD 
CD 

>- X 

CD 

< 

< Lk. 
< 

»- X 

CD 

CD 

CD < 



CJ 
H- 

CJ CD 

CJ 

< 

<t Lu 

< 
< 

< Lu 
< 

CD 

< —J 

CD 
CJ 

CD CL 

CD 

CD 

CJ CJ 

< 

CD 

CD »- 



CJ 
CD 



< 

CJ 
CJ 
CD 



CD 
CD 
CJ 
CJ 
CD 
CD 
CD 



< 

CD 



< 

CJ 
< 
CD 
CD 
< 

< 

CJ 
CD 
O 
CJ 



CJ 

►— 

LJ 



CJ 
CD 



CJ 
< 

CJ 
CJ 
CD 

H- 

CD 
CD 
CJ 



CJ 

CD 
^- 
CJ 
CD 
< 
CJ 
< 
CJ 

< 
>- 

CJ 
CJ 



CD 



LJ 
CJ 



CD 
CJ 

< 

CD 

< u. 
< 

CD 

CD CO 



CJ 



CJ 



— . CJ 



CD 
CD 
CJ CJ 

< 

CD 

< -J 
CD 

< 

< > 

U 
CJ 

CD CL 

CD 

CJ 

CJ CD 
CJ 

< 

< — 



CD 
CD 

H- X 

CD 
»— 

CJ CD 
CJ 

H- 

CD »— 
CD 

CJ cr 

CD 

CD 

< — 

CD 

< U- 
< 

CD 

< -J 

CD 
CD 

< U. 
< 

< 

CD < 
CJ 
< 
CD 

CD 

< 

CD 

CD < 

CJ 

< 

CJ CD 

CJ 

CD 

< Lu 
< 

CD 

< >- 
CJ 

< 

CD < 

CJ 

H- 

CD >- 



CD 



< 

CD 
CD 
< 
CD 

< Lu 
< 
CJ 

h- o 

CD 
CD 

< Lu 



CD 
CD 
CD 

CD 
h- 
CD 
CD 
CD 

CJ 
CJ 
LJ 
CD 



CD 
CD 
<t 
CJ 
CD 
CD 

CJ 
CD 
*— 
CJ 
CJ 
CJ 
CJ 
CD 
CD 
CJ 
CD 
CD 

<s. 

CD 



< 
CJ 
CJ 



CD 
«t 
CD 
CD 



CD 
CD 
CJ 
CD 
CJ 
CD 



CD 



CO H- 

CD 
CD 
CD 



CJ 

< 



< _I 

CD 
< 

< —i 
CD 

CJ 

< -J 

CD 
CD 

< Lu 

< . 
CD 

< _J 

CD 
♦— 

CJ CD 

CJ 

CJ 

CJ 

< 

CJ CD 

CJ 

CJ 

< —I 

CD 
CD 

CD <t 

CJ 

< 

< 

CD 

CJ CD 
CJ 

CD <* 

CJ 

CJ 

< —J 
CD 

CJ 

< _J 
CD 

CD 

CD CL. 

CD 

CJ 

CJ cc 

CD 

CJ 

cj cr 

CJ 

>— o 

CD 
^- 

CD CL. 

CD 
CD 

X 

CD 
CJ 

>— LU 

CJ 

CJ 

< -J 

< 

CD 

CD «t 

CJ 

CJ 

CD <t 

CJ 

CD 

CJ CJ 

< 
< 

< > 

CJ 

< 

CD <t 
CJ 

CD CL 
CD 

CD 

< Lu 
< 

< 

CJ CD 

CJ 

CJ 

< c 



CD 



< 
< 



CJ 
CJ 

< 

CD 



CJ 
CD 



CJ 
LJ 
CJ 
CJ 
>— 
CJ 
CD 
CD 
< 
CD 



CJ 
< 
CD 
CD 
CD 
CD 
< 
CJ 
CD 
CD 
CD 
LJ 
CD 
< 
CD 
CD 



CJ 
CD 



CD 
CD 



CJ 
CJ 
CJ 
CJ 



CD 

<: 
o 

CD 
CD 
CJ 

< 



CD 
< 
< 
CJ 
CJ 
CJ 
CJ 

< 

CD 
CD 
CJ 
< 
CJ 

CJ 
CJ 
CJ 
CJ 
CD 
< 
CJ 
CJ 

CJ 
CD 



< 

CD CL 
CD 
»— 
CJ 

< 

CD 

CD CO 

< 

CD 

CJ CJ 

< 

CJ 

< -J 

CD 
CD 

CD CL 

CD 

< 

CD < 

CJ 

CJ 

LU 

CJ 
I— 

CD CO 

< 

CD 

h- Q 

CJ 
CJ 

CJ CD 



cd <: 

CJ 
CJ 

cj cr 

CD 
CJ 

*— LU 

CJ 
CJ 

< -J 
< 

< 

< _J 

CD 
CJ 

<t > 
CJ 

< 

CJ CJ 

< 

< _J 

CD 
CD 

CD Ql 

CD 

< 

< -J 

CD 
CD 

CJ CO 

I— 

CD 

CJ CD 

CJ 

CD 

»— >- 
<t 

CD 

< -J 

CD 
CJ 

CD 

CD CL 

CD 

CD 

H- O 

CJ 
CJ 

CD »- 
CD 

< -J 

CD 
CD 

CD CL 

CD 

CJ 

»- o 

CD 
CD 

<t -J 

CD 

CJ 









LJ - " 


CD 




< 






< 






CD 


CJ 


CD 




< 




CD 


CJ 






< 


Lk. 


<t 








< 




CJ • 


CD 


CO 


CD « 


CJ 






<t 




I— 


< 


> 


CJ 


CD 




CD 


CJ 






<t 


~ 


CD 


CJ 




< ' 


H- 




CD 


CJ 


CD 


CD - 


' CJ 




CD - 


CJ 




< 


H- 


LU 


< - 


♦— 




CD 


CJ 




< 


»— 


^ 


CJ 


CD 






»— 




CJ 


CD 


CL 


CD 


LJ 




CJ ■ 


CD 




H- « 


< 


_J 


CJ 


CD 




H— ■ 


< 






< 


> 




< 




CD 


CJ 






< 


_J 


CD 


LJ 




CJ 


CD 




CD - 


• CJ 


CD 


CJ - 


• CD 




CD 


CJ 




CJ 


CD 


CL 


»— 


< 




CJ 


CD 




CJ 


CD 


< 


CD 


UJ 




CD 


LJ 




< ' 


h— 


a 


CD • 


LJ 




CJ ' 


CD 




»— ■ 




_J 


CD 






LJ 


CD 




■ 


<: 


> 


CD 


LJ 




CD 


cj 




CD 


i_) 


CD 


CD 


(_j 




CD • 


• CJ 




CD ■ 


* CJ 


<r 


< ' 












< ' 


l_ 


UJ 


< 


i, 




CD 


LJ 




< 




a 


CD 


LJ 




CJ 


CD 




CD ■ 


LJ 


CD 


CJ « 


CD 




CD 


LJ 






< 


_ 


<t 


»— 




<t 






CJ 


CD 


< 








CD 


LJ 




< 




Q 


H- - 


- <t 




CD « 


" CJ 




CJ 


CD 




< ■ 












CJ 


CD 


<t 




< 




CD 


LJ 




CD 


LJ 


CD 


< 






CD « 


• LJ 




< « 




UJ 


CD 


CJ 




CD 


CJ 




CD 


CJ 


cr 








< ■ 






< 






< 






<; ■ 






CJ - 


• CD 


CL 


CD - 


r cj 




CJ 


CD 




< 


»— 


LU 


CJ 


CD 




CD 


CJ 




CJ 


CD 


CL 




»— 




CJ 


CD 




< 


>— 


o 


CJ • 


• CD 




CJ • 


• CD 






<. 


-J 


CJ 


CD 




CJ 


CD 




<t • 




Q 


CJ 


CD 




CD 


CJ 




CJ 


CD 


< 


CD 


CJ 




CD - 


- CJ 




CD - 


- o 


cr 








<t 






CJ 


CD 


CL 


CJ 


CD 




CJ 


CD 




CJ 


CD 


CL 


CJ 


CD 




CJ ■ 


CD 




CJ ' 


• CD 


CO 


« 


■ <. 




*•— 


*t 




CD 1 


CJ 


cr 


CJ 


CD 




CJ 


CD 




CD 


CJ 


CD 


CD 


CJ 




CD 


CJ 




y— 


<l 


—i 


CJ " 


■ CD 






. <j; 




CJ 


CD 


H* 


CD 


CJ 




< 






< 






CJ ' 


CD 






< 




« 




—J 


CJ 


CD 




CJ - 


• CD 




CD ' 


' CJ 


co 




< 




< 






LJ 


CD 




CD 


CJ 




< 


K— 




< 




o 




h- 




CJ 


CD 




CD - 


• CJ 




CJ • 


► CD 










CD 


CJ 


CO 


< 














< 


-J 


< 






CJ 


CD 




CD 


CJ 




CD < 


■ CJ 




>— 


> < 




CJ 


CD 






< 




< 






< 


»— 


o 




< 




CD 


CJ 




< 


*— 


o 


CD 


CJ 




CD - 


• CJ 




CD 


- CJ 


CD 


CD 


CJ 




CD 


CJ 




K- 


< 


mJ 


< 






CJ 


CD 






< 


> 


CJ 


CD 




a 


CJ 




»— 


• <. 


—J 




• < 




CJ 


CD 








Lu 


CD 


CJ 




»— 


< 




»— 


«t 


a: 




< 






*— 






< 





CM 
I 

< 

CM 

g 

LL 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WX> 98/54333 PCT/US98/11415 

7 /20 



ATGGCGGCGCTGGGCGGGGACGGGCTGCGTTTACTGTCGGTATCGCGGCCAGAGCGGCAGCCCGAGTCAACCGCGCTGAG 
» ' 1 1 I 1 I — ! 1 ! I 1 1 . ■ ■ | 

TACCGCCGCGACCCGCCCCTGCCCGACGCAAATGACAGCCATAGCGCCGGTCTCGCCGTCGGGCTCAGTTGGCGCGACTC 

MAALGGDGLRLLSVS RPEROPESTALS 
\ ■ ' ■ ' ■ ' — * — i ■ ■ 



CGGCCCGGGCTCGGGGCTGTGCTGCTGGGTGTCTGTGTTCTCCTGCTTCAGCCTCGCCTGCTCCTACGTGGGCAGCCTCT 
1 1 1 I ' I ' 1 1 ■ < h 

GCCGGGCCCGAGCCCCGACACGACGACCCACAGACACAAGAGGACGAAGTCGGAGCGGACGAGGATGCACCCGTCGGAGA 
GPGSGLCCWVSVFSCFSlACSYVGSL 



ACGTGTGGAAGAGCGAGCTGCCCAGGGACCACCCCGCTGTTATC AAGCGGCGTTCCACCAGTGTCCTGG7AGTGTCCAGC 
1 1 — 1 » I 1,1 I ■ 1 1 1 1 h 

TGCACACCTTCTCGCTCGACGGGTCCCTGGTGGGGCGACAATAGTTCGCCGCAAGGTGGTC ACAGGACCATCACAGGTCG 

YVWKSELPRDHPA'Vl KRRSTSVLVVSS 
■ 1 ■ 1 ■ ■ ■ — > ■ i , i ■ i i i 



TTGTCCCCTCTTTGCGTGCTGCTCTGGAGGGAACTCACTGGCATCCAGCCAGGCACATCACTGCTTACCTTGATGGGCTT 
1 1 1 ! ' 1 I 1 1 1 1 ■ ■ ■ I 

AACAGGGGAGAAACGCACGACGAGACCTCCCTTGAGTGACCGTAGGTCGGTCCGTGTAGTGACGAATGGAACTACCCGAA 

LSPLCVLLWRELTG I QPGT SLLTLMGF 
■ ' ■ — » ■ 1 ■ » ■ 1 1 ■ ■ . ,i 



CAGGCTGGAGGGCATTTTCCCAGCAGCGCTGCTGCCCCTGCTGCTAACTATGATCCTTTTCCTGGGTCCACTGATGCAGC 
■ ► 1 ' I ' 1 — — 1 I 1 1 ■ I I 

GTCCGACCTCCCGTAAAAGGGTCGTCGCGACGACGGGGACGACGATTGATACTAGGAAAAGGACCCAGGTGACTACGTCG 

RLEG I FPAALLPLLLTH I LFLGPLMQ 

' ■ 4 1 ■ 1 i * - - - ■ 



TCTCTATGGATTGCCCTTGTGACCTGACAGATGGGCTGAAGGTTGTCCTGGCCCCTCGTTCTTGGGCCCGCTGCCTCACA 

1 I ' 1 ' 1 H- 1 1 1 ■ \ 1 1 ■ | 

AGAGATACCTAACGGGAACACTGGACTGTCTACCCGACTTCCAACAGGACCGGGGAGCAAGAACCCGGGCGACGGAGTGT 

LSMDCPCOLTDGLKVVLAPRSWARCLT 
■ » ■ ■ — ■ » i ■ i ■ i - 
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gacatgcgctgg ctacgaaaccaagttattgcaccgctgacagaggagctggtgttccggg:ttgcatgctgcccatgct 

CTGTACGCGACCGATGCTTTGGTTCAATAACGTGGCGACTGTCTCCTCGACCACAAGGCCCGAACGTACGACGGGTACGA 
0 M R W L R N 0 V I A P L T E E L V F R A C H L P H L 



\GCGCCGTGCACGGGTCTGCGCCCTGCTGTGTTCACCTGCCCAC 



TCTTTTTTGGAGTCGCCCATTTTCACCACATTATTG 



TCGCGGCACGTGCCCAGACCCGGGACGACACAAGTGGACGGGTGAGAAAAAACCTCAGCGGGTAAAAGTGGTGTAATAAC 

A P C T G L G P A V F T C P L F F G V A H F H H ! 1 _ 
i ■■' ■ 1 1 1 ' 

agcagctgcgcttc cgccagagcagtgtgggaagtatcttcgtgtctgcagcgttccagttctcctacaccgctg-ttc 

TCGTCGACGCGAAGGCGGTCTCGTCACACCCTTC ATAGAAGCACAGACG'CGCAAGGTC AAGAGGATGTGGCGACA5AAG 

E Q L R F R 0 S S V G S 1 F V S A A F 0 / S 1 T A , F_ 

G6TGCTTATA CAGCTTTCCTCTTCATCCGCACAGGACACCTGATAGGGCCGGTTCTCTGCCACTCTTTCTGCAACTACAT 
"CCACGAATATGTCGAAAGGAGAAGTAGGCGTGTCCTGTGGACTATCCCGGCCAAGAGACGGTGAGAAAGACGTTGATGTA 

C A Y T A F L F I R T G H L I C P V L C H S F C M T H 

GGGCTT CCCTGCAGTGTGTGCAGCCCTGGAGCATCCACAGAAGTGGCCACTGCTGGCAGGCTATGCCCTCGGTST3G6AC 
CCCGAAGGGACGTCACACACGTCGGGACCTCGTAGGTGTCTTCACCGGTGACGACCGTCCGATACGGGAGCCACACCCTG 

gFPAVCAA LEHP 0 K W P L L A G Y A U G V G 



TTTTCCTGCTTCTGCTTCAACCCCTGACAGACCCCAAGCTCTATGGCAGCCTTCCTCTTTGTATGCTTTTGGAAAGAACA 

IaaaggacgaagacgaagtIggggactgtctggggttcgIgataccgtcggaaggagaaacatacgaaaacctttcttgt 

L F L L L L 0 P L T 0 P K L T C S U P L C H L L E R T 



GGGGCCTCAGAGACCCTACTGTGCTCCTGA 

i | ' 1 1 > 990 
CCCCGGAGTCTCTGGGATGACACGAGGACT 

G A S E T L L C S . 



FIG. 2B-2 
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